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SECRETED PROTEINS AND USES THEREOF 

This application claims priority to co-pending U.S. Application No. 
09/342,687, filed June 29, 1999, the entire contents of which are incorporated herein 
by reference in its entirety. 

5 

Background of the Invention 

Many secreted proteins, for example, cytokines, play a vital role in the 
regulation of cell growth, cell differentiation, and a variety of specific cellular 
responses, A number of medically useful proteins, including erythropoietin, 

10 granulocyte-macrophage colony stimulating factor, human growth hormone, and 
various interleukins, are secreted proteins. Thus, an important goal in the design and 
development of new therapies is the identification and characterization of membrane- 
associated and secreted proteins and the genes which encode them. 

Many membrane-associated proteins are receptors which bind a ligand and 

15 transduce an intracellular signal, leading to a variety of cellular responses. The 

identification and characterization of such a receptor enables one to identify both the 
ligands which bind to the receptor and the intracellular molecules and signal 
transduction pathways associated with the receptor, permitting one to identify or 
design modulators of receptor activity, e.g., receptor agonists or antagonists and 

20 modulators of signal transduction. 

Summary of the Invention 

The present invention is based, at least in part, on the discovery of cDNA 
molecules encoding TANGO 244, TANGO 246, TANGO 275, TANGO 300, and 

25 MANGO 245, all of which are predicted to be either wholly secreted or 

transmembrane proteins. These proteins, fragments, derivatives, and variants thereof 
are collectively referred to as a "polypeptides of the invention" or "proteins of the 
invention." Nucleic acid molecules encoding the polypeptides or proteins of the 
invention are collectively referred to as "nucleic acids of the invention " 

30 The nucleic acids and polypeptides of the present invention are useftil as 

modulating agents in regulating a variety of cellular processes. Accordingly, in one 

1 
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aspect, this invention provides isolated nucleic acid molecules encoding a polypeptide 
of the invention or a biologically active portion thereof. The present invention also 
provides nucleic acid molecules which are suitable for use as primers or hybridization 
probes for the detection of nucleic acids encoding a polypeptide of the invention. 



55%, 65%, 75%, 85%, 95%, or 98%) identical to the nucleotide sequence of SEQ ID 
NOs:l,3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22, 24, or 25, the nucleotide sequence 
of the cDNA insert of a clone deposited with ATCC® as Accession Number 207220 
(the "cDNA of ATCC® Accession Number 207220"), the nucleotide sequence of the 

10 cDNA insert of a clone deposited with ATCC® as Accession Number 207223 (the 
"cDNA of ATCC® Accession Number 207223"), the nucleotide sequence of the 
cDNA insert of a clone deposited with ATCC® as Accession Number PTA-248 (the 
"cDNA of ATCC® Accession Number PTA-248"), or the nucleotide sequence of the 
cDNA insert of a clone deposited with ATCC® as Accession Number PTA-293 (the 

1 5 "cDNA of ATCC® Accession Number PTA-293"). 



at least 300 (325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 800, 900, 1000, 
1200, 1400, 1600, 1800, 2000, 2400, 2600, 2800, 3000, 3200, 3400, 3600, 3800, or 
4000) nucleotides of the nucleotide sequence of SEQ ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 

20 13, 15, 16, 18, 19, 22, 24, or 25, the nucleotide sequence of the cDNA of ATCC® 
Accession Number 207220, the nucleotide sequence of the cDNA of ATCC® 
Accession Number 207223, the nucleotide sequence of the cDNA of ATCC® 
Accession Number PTA-248, or the nucleotide sequence of the cDNA of ATCC® 
Accession Number PTA-293, or a complement thereof. 

25 The invention also features nucleic acid molecules which include a 

nucleotide sequence encoding a protein having an amino acid sequence that is at least 
45% (or 55%, 65%, 75%, 85%, 95%, or 98%) identical to the amino acid sequence of 
SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, the amino acid sequence encoded by 
the cDNA of ATCC® Accession Number 207220, the amino acid sequence encoded 

30 by the cDNA of ATCC® Accession Number 207223, the amino acid sequence 



5 



The invention features nucleic acid molecules which are at least 45% (or 



The invention features nucleic acid molecules which include a fragment of 
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encoded by the cDNA of ATCC® Accession Number PTA-248, or the amino acid 
sequence encoded by the cDNA of ATCC® Accession Number PTA-293. 

In preferred embodiments, the nucleic acid molecules have the nucleotide 
sequence of SEQ ID NOs: 1,3,4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22, 24, or 25, the 
5 nucleotide sequence of the cDNA of ATCC® Accession Number 207220, the 
nucleotide sequence of the cDNA of ATCC® Accession Number 207223, the 
nucleotide sequence of the cDNA of ATCC® Accession Number PTA-248, or the 
nucleotide sequence of the cDNA of ATCC® Accession Number PTA-293, or a 
complement thereof. 

10 Also within the invention are nucleic acid molecules which encode a 

fragment of a polypeptide having the amino acid sequence of SEQ ID N0s:2, 5,8,11, 
14, 17, 20, 23, or 91, or a fragment including at least 15 (25, 30, 50, 100, 150, 300, 
400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, or 1400) contiguous amino 
acids of SEQ ID NOs:2, 5, 8, 1 1, 14, 17, 20, 23, or 91, the amino acid sequence 

15 encoded by the cDNA of ATCC® Accession Number 207220, the amino acid 
sequence encoded by the cDNA of ATCC® Accession Number 207223, the amino 
acid sequence encoded by the cDNA of ATCC® Accession Number PTA-248, or the 
amino acid sequence encoded by the cDNA of ATCC® Accession Number PTA-293. 
The invention includes nucleic acid molecules which encode a naturally 

20 occurring allelic variant of a polypeptide comprising the amino acid sequence of SEQ 
ID N0s:2, 5, 8, 1 1, 14, 17, 20, 23, or 91, the amino acid sequence encoded by the 
cDNA of ATCC® Accession Number 207220, the amino acid sequence encoded by 
the cDNA of ATCC® Accession Number 207223, the amino acid sequence encoded 
by the cDNA of ATCC® Accession Number PTA-248, or the amino acid sequence 

25 encoded by the cDNA of ATCC® Accession Number PTA-293, wherein the nucleic 
acid molecule hybridizes to a nucleic acid molecule consisting of a nucleic acid 
sequence encoding SEQ ID N0s:2, 5, 8, 1 1, 14, 17, 20, 23, or 91, the nucleotide 
sequence of the cDNA of ATCC® Accession Number 207220, the nucleotide 
sequence of the cDNA of ATCC® Accession Number 207223, the nucleotide 

30 sequence of the cDNA of ATCC® Accession Number PTA-248, or the nucleotide 
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sequence of the cDNA of ATCC® Accession Number PTA-293, or a complement 
thereof under stringent conditions. 

Also within the invention are isolated polypeptides or proteins having an 
amino acid sequence that is at least about 60%, preferably 65%, 75%, 85%, 95%, or 
5 98% identical to the amino acid sequence of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, 
or 91, the amino acid sequence encoded by the cDNA of ATCC® Accession Number 
207220, the amino acid sequence encoded by the cDNA of ATCC® Accession 
Number 207223, the amino acid sequence encoded by the cDNA of ATCC® 
Accession Number PTA-248, or the amino acid sequence encoded by the cDNA of 

10 ATCC® Accession Number PTA-293. 

Also within the invention are isolated polypeptides or proteins which are 
encoded by a nucleic acid molecule having a nucleotide sequence that is at least about 
60%, preferably 65%, 75%, 85%), or 95% identical the nucleic acid sequence encoding 
SEQ ID N0s:2, 5, 8, 1 1, 14, 17, 20, 23, or 91, and isolated polypeptides or proteins 

15 which are encoded by a nucleic acid molecule having a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a nucleic acid molecule having 
the nucleotide sequence of SEQ ID NOs: 1, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22, 
24, or 25, or complement thereof, the non-coding strand of the cDNA of ATCC® 
Accession Number 207220, the non-coding strand of the cDNA of ATCC® Accession 

20 Number 207223, the non-coding strand of the cDNA of ATCC® Accession Number 
PTA-248, or the non-coding strand of the cDNA of ATCC® Accession Number PTA- 
293. 

Also within the invention are polypeptides which are naturally occurring 
allelic variants of a polypeptide that includes the amino acid sequence of SEQ ID 

25 N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, the amino acid sequence encoded by the cDNA 
of ATCC® Accession Number 207220, the amino acid sequence encoded by the 
cDNA of ATCC® Accession Number 207223, the amino acid sequence encoded by 
the cDNA of ATCC® Accession Number PTA-248, or the amino acid sequence 
encoded by the cDNA of ATCC® Accession Number PTA-293, wherein the 

30 polypeptide is encoded by a nucleic acid molecule which hybridizes to a nucleic acid 
molecule having the sequence ofSEQ ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 

4 
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19, 22, 24, or 25, or a complement thereof, under stringent conditions. Such allelic 
variant differ at 1%, 2%, 3%, 4%, or 5% of the amino acid residues. 

The invention also features nucleic acid molecules that hybridize under 
stringent conditions to a nucleic acid molecule having the nucleotide sequence of SEQ 
5 ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22, 24, or 25, the cDNA of ATCC® 
Accession Number 207220, the cDNA of ATCC® Accession Number 207223, the 
cDNA of ATCC® Accession Number PTA-248, or the cDNA of ATCC® Accession 
Number PTA-293, or a complement thereof In other embodiments, the nucleic acid 
molecules are at least 300 (325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 

10 800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 3000, 3200, 
3400, 3600, 3800, 4000, or 4200) nucleotides in length and hybridize under stringent 
conditions to a nucleic acid molecule consisting of the nucleotide sequence of SEQ ID 
NOs: 1,3,4, 6,7,9, 10, 12, 13, 15, 16, 18, 19, 22, 24, or 25, the cDN A of ATCC® 
Accession Number 207220, the cDNA of ATCC® Accession Number 207223, the 

1 5 cDNA of ATCC® Accession Number PTA-248, or the cDNA of ATCC® Accession 
Number PTA-293, or a complement thereof 

In other embodiments, the isolated nucleic acid molecules encode an 
extracellular, transmembrane, or cytoplasmic domain of a polypeptide of the 
invention. 

20 In another embodiment, the invention provides an isolated nucleic acid 

molecule which is antisense to the coding strand of a nucleic acid of the invention. 

Another aspect of the invention provides vectors, e.g., recombinant 
expression vectors, comprising a nucleic acid molecule of the invention. In another 
embodiment, the invention provides host cells containing such a vector or a nucleic 

25 acid molecule of the invention. The invention also provides methods for producing a 
polypeptide of the invention by culturing, in a suitable medium, a host cell of the 
invention containing a recombinant expression vector such that a polypeptide is 
produced. 

Another aspect of this invention features isolated or recombinant proteins 
30 and polypeptides of the invention. Preferred proteins and polypeptides possess at 
least one biological activity possessed by the corresponding naturally-occurring 

5 
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human polypeptide. An activity, a biological activity, or a functional activity of a 
polypeptide or nucleic acid of the invention refers to an activity exerted by a protein, 
polypeptide or nucleic acid molecule of the invention on a responsive cell as 
determined in vivo, or in vitro, according to standard techniques. Such activities can 
5 be a direct activity, such as an association with or an enzymatic activity on a second 
protein or an indirect activity, such as a cellular signaling activity mediated by 
interaction of the protein with a second protein. 

For TANGO 244, biological activities include, e.g., (1) the ability to form 
protein-protein interactions with proteins in the signaling pathway of the naturally- 

10 occurring polypeptide; (2) the ability to bind a ligand of the naturally-occurring 
polypeptide; and (3) the ability to interact with a TANGO 244 receptor. Other 
activities include the ability to modulate function, survival, morphology, proliferation 
and/or differentiation of cells of tissues in which it is expressed. 

For TANGO 246, biological activities include, e.g., (1) the ability to form 

15 protein-protein interactions with proteins in the signaling pathway of the naturally- 
occurring polypeptide; (2) the ability to bind a ligand of the naturally-occurring 
polypeptide; and (3) the ability to interact with a TANGO 246 receptor. Other 
activities include the ability to modulate function, survival, morphology, proliferation 
and/or differentiation of cells of tissues in which it is expressed. TANGO 246 

20 biological activities can include the ability to act as a small molecule transporter or a 
cell cycle regulator. 

For TANGO 275, biological activities include, e.g., (1) the ability to form 
protein-protein interactions with proteins in the signaling pathway of the naturally- 
occurring polypeptide; (2) the ability to bind a ligand of the naturally-occurring 

25 polypeptide; and (3) the ability to interact with a TANGO 275 receptor. Other 

activities include the ability to modulate function, survival, morphology, proliferation 
and/or differentiation of cells of tissues in which it is expressed {e.g., cells of the 
pituitary gland). TANGO 275 biological activities can include: (1) the ability to act 
as a TGF-P binding protein; (2) the ability to facilitate the normal assembly and 

30 secretion of large latent complexes containing TGF-P; (3) the ability to target latent 
TGF-P to connective tissue; (4) the ability to target latent TGF-p to the cell surface; 

6 
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(5) the ability to modulate bone formation, renewal, or remodelling; and (6) the ability 
to modulate the development or function of the heart, cardiovascular system, brain, 
placenta, liver, skeletal muscle, kidney or pancreas. 



5 protein-protein interactions with proteins in the signaling pathway of the naturally- 
occurring polypeptide; (2) the ability to bind a ligand of the naturally-occurring 
polypeptide; (3) the ability to interact with a TANGO 300 receptor; and (4) the ability 
to mediate an intracellular signal Other activities include the ability to modulate 
function, survival, morphology, proliferation and/or differentiation of cells of tissues 
1 0 in which it is expressed. 



protein-protein interactions with proteins in the signaling pathway of the naturally- 
occurring polypeptide; (2) the ability to bind a ligand of the naturally-occurring 
polypeptide; and (3) the ability to interact with a MANGO 245 receptor. Other 

15 activities include the ability to modulate function, survival, morphology, proliferation 
and/or differentiation of cells of tissues in which it is expressed, e.g., the central 
nervous system, and the ability to modulate the cellular functions of cells of the 
nervous system (neurons and glial cells), and the ability to act as a modulator of 
complement function. 

20 In one embodiment, a polypeptide of the invention has an amino acid 

sequence sufficiently identical to an identified domain of a polypeptide of the 
invention. As used herein, the term "sufficiently identical" refers to a first amino acid 
or nucleotide sequence which contains a sufficient or minimum number of identical or 
equivalent (e.g., with a similar side chain) amino acid residues or nucleotides to a 

25 second amino acid or nucleotide sequence such that the first and second amino acid or 
nucleotide sequences have a common structural domain and/or common functional 
activity. For example, amino acid or nucleotide sequences which contain a common 
structural domain having about 60% identity, preferably 65% identity, more 
preferably 75%, 85%, 95%, 98% or more identity are defined herein as sufficiently 

30 identical. 



For TANGO 300, biological activities include, eg., (1) the ability to form 



For MANGO 245, biological activities include, e.g., (1) the ability to form 
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In one embodiment, a TANGO 244, TANGO 246, TANGO 275, TANGO 
300, or MANGO 245 polypeptide of the invention includes a signal peptide. 

In another embodiment, a nucleic acid molecule of the invention encodes a 
TANGO 244, TANGO 246, TANGO 275, TANGO 300, or MANGO 245 polypeptide 
5 which includes a signal peptide. In another embodiment, a TANGO 244, TANGO 
246, or MANGO 245 polypeptide of the invention also includes one or more of the 
following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) 
a cytoplasmic domain. 

In one embodiment, the isolated polypeptide of the invention lacks both a 
10 transmembrane and a cytoplasmic domain. In another embodiment, the polypeptide 
lacks both a transmembrane domain and a cytoplasmic domain and is soluble under 
physiological conditions. 

The polypeptides of the present invention, or biologically active portions 
thereof can be operably linked to a heterologous amino acid sequence to form fusion 
15 proteins. In one embodiment, the fusion protein consists of a chimeric protein 
assembled from portions of the protein from different species. In another 
embodiment, the fusion protein consists of the amino terminal portion of murine 
MANGO 245 attached to the carboxy terminal portion of human MANGO 245. 

The invention further features antibodies that specifically bind a 
20 polypeptide of the invention such as monoclonal or polyclonal antibodies. In 

addition, the polypeptides of the invention or biologically active portions thereof, or 
antibodies of the invention, can be incorporated into pharmaceutical compositions, 
which optionally include pharmaceutically acceptable carriers. 

In another aspect, the present invention provides methods for detecting the 
25 presence of the activity or expression of a polypeptide of the invention in a biological 
sample by contacting the biological sample with an agent capable of detecting an 
indicator of activity such that the presence of activity is detected in the biological 
sample. 

In another aspect, the invention provides methods for modulating activity 
30 of a polypeptide of the invention comprising contacting a cell with an agent that 
modulates (inhibits or stimulates) the activity or expression of a polypeptide of the 

8 
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invention such that activity or expression in the cell is modulated. In one 
embodiment, the agent is an antibody that specifically binds to a polypeptide of the 
invention. 

In another embodiment, the agent modulates expression of a polypeptide 
5 of the invention by modulating transcription, splicing, or translation of an mRNA 
encoding a polypeptide of the invention. In yet another embodiment, the agent is a 
nucleic acid molecule having a nucleotide sequence that is antisense to the coding 
strand of an mRNA encoding a polypeptide of the invention. 

The present invention also provides methods to treat a subject having a 
10 disorder characterized by aberrant activity of a polypeptide of the invention or 

aberrant expression of a nucleic acid of the invention by administering an agent which 
is a modulator of the activity of a polypeptide of the invention or a modulator of the 
expression of a nucleic acid of the invention to the subject. In one embodiment, the 
modulator is a protein of the invention. In another embodiment, the modulator is a 
15 nucleic acid of the invention. In other embodiments, the modulator is a peptide, 
peptidomimetic, or other small organic molecule. 

The present invention also provides diagnostic assays for identifying the 
presence or absence of a genetic lesion or mutation characterized by at least one of: 
(i) aberrant modification or mutation of a gene encoding a polypeptide of the 
20 invention, (ii) mis-regulation of a gene encoding a polypeptide of the invention, and 
(iii) aberrant post-translational modification of the invention wherein a wild-type form 
of the gene encodes a protein having the activity of the polypeptide of the invention. 

In another aspect, the invention provides a method for identifying a 
compound that binds to or modulates the activity of a polypeptide of the invention. In 
25 general, such methods entail measuring a biological activity of the polypeptide in the 
presence and absence of a test compound and identifying those compounds which 
alter the activity of the polypeptide. 

The invention also features methods for identifying a compound which 
modulates the expression of a polypeptide or nucleic acid of the invention by 
30 measuring the expression of the polypeptide or nucleic acid in the presence and 
absence of the compound. 

9 
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In yet a further aspect, the invention provides substantially purified 
antibodies or fragments thereof including human and non-human antibodies or 
fragments thereof which antibodies or fragments specifically bind to a polypeptide 
comprising an amino acid sequence selected from the group consisting of: the amino 
5 acidsequenceofSEQIDNOs:2, 5, 8, 11, 14, 17, 20, 23 or 91, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207220, 207223, PTA-248 or PTA-293; a fragment of at least 15 
amino acid residues of the amino acid sequence of SEQ ID N0s:2, 5, 8, 11, 14, 17, 
20, 23, or 91; an amino acid sequence which is at least 95% identical to the amino 

10 acid sequence ofSEQ ID N0:2, 5,8, 11, 14, 17, 20, 23, or 91, wherein the percent 
identity is determined using the ALIGN program of the GCG software package with a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 or 
the BESTFIT program with BLOSUM 62 scoring matrix, gap open penalty of 12, 
frame shift penalty of 5, gap extend penalty of 4; and an amino acid sequence which is 

15 encoded by a nucleic acid molecule which hybridizes to the nucleic acid molecule 
consisting of SEQ ID:1, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22, 24, or 25, under 
conditions of hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS 
at 65°C. In various embodiments, the substantially purified antibodies of the 
invention, or fragments thereof can be human, non-human, chimeric and/or 

20 humanized antibodies. 

Any of the antibodies of the invention can be conjugated to a therapeutic 
moiety or to a detectable substance. Non-limiting examples of detectable substances 
that can be conjugated to the antibodies of the invention are an enzyme, a prosthetic 
group, a fluorescent material, a luminescent material, a bioluminescent material, and a 

25 radioactive material. 

The invention also provides a kit containing an antibody of the invention 
conjugated to a detectable substance, and instructions for use. Still another aspect of 
the invention is a pharmaceutical composition comprising an antibody of the 
invention and a pharmaceutically acceptable carrier. In preferred embodiments, the 

30 pharmaceutical composition contains an antibody of the invention, a therapeutic 
moiety, and a pharmaceutically acceptable carrier. 

10 
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Other features and advantages of the invention will be apparent from the 
following detailed description and Claims. 

Brief Description of the Drawings 

5 Figure 1 depicts the cDNA sequence (SEQ ID NO: 1 ) and the predicted 

amino acid sequence (SEQ ID N0:2) of human TANGO 244. The open reading 
frame of SEQ ID N0:1 extends from nucleotide 85 to nucleotide 570 of SEQ ID 
N0:1 (SEQ ID NO:3), 

Figure 2 depicts a hydropathy plot of human TANGO 244. Relatively 

10 hydrophobic regions are above the dashed horizontal line, and relatively hydrophilic 
regions are below the dashed horizontal line. The cysteine residues (cys) are indicated 
by short vertical lines just below the hydropathy trace. 

Figure 3 depicts an alignment of the immunoglobulin domain of human 
TANGO 244 (SEQ ID NO:28) with a consensus hidden Markov model 

15 immunoglobulin domain (SEQ ID NO:29). In the consensus sequence, more 

conserved residues are indicated by uppercase letters, and less conserved residues are 
indicated by lowercase letters. A within a sequence indicates a gap created in the 
sequence for purposes of alignment. A between the aligned sequences indicates a 
conservative amino acid difference. 

20 Figure 4 depicts an ahgnment of the amino acid sequence of human 

TANGO 244 (SEQ ID N0:2) and the amino acid sequence of human CTH (SEQ ID 
N0:81; Genbank Accession Number AF061022; Marcuz et al., Eur J. Immunol. 
28:4094-4104). This alignment was created using ALIGN (version 2.0; PAM120 
scoring matrix; gap length penalty of 12; gap penalty of 4), In this alignment, the 

25 sequences are 48.6% identical. 

Figures 5A-5B depict the cDNA sequence (SEQ ID NO:4) and the 
predicted amino acid sequence (SEQ ID N0:5) of human TANGO 246. The open 
reading frame of SEQ ED N0:4 extends from nucleotide 94 to nucleotide 1080 of 
SEQ ID N0:4 (SEQ ID N0:6). 

30 Figure 6 depicts a hydropathy plot of human TANGO 246. Relatively 

hydrophobic regions are above the dashed horizontal line, and relatively hydrophilic 

11 
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regions are below the clashed horizontal line. The cysteine residues (cys) and 
potential N-glycosylation sites (Ngly) are indicated by short vertical lines just below 
the hydropathy trace. 

Figure 7 depicts an alignment of the cell cycle protein domain of human 
5 TANGO 246 (SEQ ID NO:30) with a consensus hidden Markov model cell cycle 
protein domain (SEQ ID N0:3 1). In the consensus sequence, more conserved 
residues are indicated by uppercase letters, and less conserved residues are indicated 
by lowercase letters. A within a sequence indicates a gap created in the sequence 
for purposes of alignment. A between the aligned sequences indicates a 

10 conservative amino acid difference. 

Figure 8 depicts an alignment of the ABC transporter domain of human 
TANGO 246 (SEQ ID NO:32) with a consensus hidden Markov model ABC 
transporter domain (SEQ ID NO:33). In the consensus sequence, more conserved 
residues are indicated by uppercase letters, and less conserved residues are indicated 

15 by lowercase letters. A within a sequence indicates a gap created in the sequence 
for purposes of alignment. A between the aligned sequences indicates a 
conservative amino acid difference. 

Figures 9A-9D depict the cDNA sequence (SEQ ID NO: 7) and the 
predicted amino acid sequence (SEQ ID N0:8) of human TANGO 275. The open 

20 reading frame of SEQ ED N0:7 extends from nucleotide 23 to nucleotide 3931 SEQ 
ID N0:7 (SEQ ID NO:9). 

Figure 70 depicts a hydropathy plot of human TANGO 275. Relatively 
hydrophobic regions are above the dashed horizontal line, and relatively hydrophilic 
regions are below the dashed horizontal line. The cysteine residues (cys) and 

25 potential N-glycosylation sites (Ngly) are indicated by short vertical lines just below 
the hydropathy trace. 

Figures llA-UB depict alignments of the EGF-like domains of human 
TANGO 275 (SEQ ID NOs:34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47 and 
48) with a consensus hidden Markov model EGF-like domain (SEQ ID NO:49). In 

30 the consensus sequence, more conserved residues are indicated by uppercase letters, 
and less conserved residues are indicated by lowercase letters. A within a 
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sequence indicates a gap created in the sequence for purposes of alignment. A 
between the aligned sequences indicates a conservative amino acid difference. 

Figure 12 depicts alignments of the TB domains of human TANGO 275 
(SEQ ID NOs:50, 51, 52, and 53) with a consensus hidden Markov model TB domain 
5 (SEQ ID NO:54). In the consensus sequence, more conserved residues are indicated 
by uppercase letters, and less conserved residues are indicated by lowercase letters. A 

within a sequence indicates a gap created in the sequence for purposes of 
alignment. A between the aligned sequences indicates a conservative amino acid 
difference. 

10 Figure 13 depicts alignments of the metallothionein domain of human 

TANGO 275 (SEQ ID NO:55) with a consensus hidden Markov model 
metallothionein domain (SEQ ID NO:56). In the consensus sequence, more 
conserved residues are indicated by uppercase letters, and less conserved residues are 
indicated by lowercase letters. A within a sequence indicates a gap created in the 

15 sequence for purposes of alignment. A between the aligned sequences indicates a 
conservative amino acid difference. 

Figures 14A-14H depict an alignment of the nucleotide sequence of human 
TANGO 275 (SEQ ID NO: 7) and the nucleotide sequence of murine LTBP-3 
(Genbank Accession Number L40459; SEQ ID NO:82). This alignment was created 

20 using ALIGN (version 2.0; PAMl 20 scoring matrix; gap length penalty of 1 2; gap 
penalty of 4). In this alignment, the sequences are 77.1% identical. 

Figures 15A-15C depict an alignment of the amino acid sequence of 
human TANGO 275 (SEQ ID N0:8) and the amino acid sequence of murine LTBP-3 
(GENSEQO Accession Number R79475; SEQ ID NO:83). This alignment was 

25 created using ALIGN (version 2.0; PAMl 20 scoring matrix, gap length penalty of 12; 
gap penalty of 4). In this alignment, the sequences are 82.8% identical. 

Figures 16A-16G depict the cDNA sequence (SEQ ID NO: 10) and the 
predicted amino acid sequence (SEQ ED N0:1 1) of murine TANGO 275. The open 
reading frame of SEQ ID NO: 10 extends from nucleotide 157 to nucleotide 3916 of 

30 SEQIDNO:10(SEQIDNO:12). 
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Figure 1 7A-1 7B depicts the cDNA sequence (SEQ ID NO: 1 3) and the 
predicted amino acid sequence (SEQ ID NO: 14) of human TANGO 300. The open 
reading frame of SEQ ID NO: 13 extends from nucleotide 31 to nucleotide 1 1 13 of 
SEQ ID NO: 13 (SEQ ID NO: 15). 
5 Figure 18 depicts a hydropathy plot of human TANGO 300. Relatively 

hydrophobic regions are above the dashed horizontal line, and relatively hydrophilic 
regions are below the dashed horizontal line. The cysteine residues (cys) and 
potential N-glycosylation sites (Ngly) are indicated by short vertical lines just below 
the hydropathy trace. 

10 Figures I9A-19C depicts the cDNA sequence (SEQ ID NO: 16) and the 

predicted amino acid sequence (SEQ ID N0:17) of murine TANGO 300. The open 
reading frame of SEQ ID NO: 16 extends from nucleotide 41 to nucleotide 1 195 of 
SEQ ID NO: 16 (SEQ ID NO: 18). 

Figure 20 depicts a hydropathy plot of murine TANGO 300. Relatively 

15 hydrophobic regions are above the dashed horizontal line, and relatively hydrophilic 
regions are below the dashed horizontal line. The cysteine residues (cys) and 
potential N-glycosylation sites (Ngly) are indicated by short vertical lines just below 
the hydropathy trace. 

Figures 21A-21B depict an alignment of the nucleotide sequence of the 

20 ORE of human TANGO 300 (SEQ ID NO: 15) and the nucleotide sequence of the 
ORE of murine TANGO 300 (SEQ ID NO: 18). This alignment was created using 
BESTFIT (BLOSUM 62 scoring matrix; gap open penalty of 12; frame shift penalty 
of 5; gap extend penalty of 4). In this alignment, the sequences are 77.7% identical. 
Figure 22 depicts an aligimient of the amino acid sequence of human 

25 TANGO 300 (SEQ ID NO:14) and the amino acid sequence of murine TANGO 300 
(SEQ ID NO: 1 7). This alignment was created using BESTFIT (BLOSUM 62 scoring 
matrix; gap open penalty of 12; frame shift penalty of 5; gap extend penalty of 4). In 
this alignment, the sequences are 69.6% identical. 

Figures 23A-23B depicts the cDNA sequence (SEQ ID NO: 19) and the 

30 predicted amino acid sequence (SEQ ID NO:20) of human MANGO 245. The open 
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reading frame of SEQ ID NO: 19 extends from nucleotide 105 to nucleotide 11 48 of 
SEQ ID NO: 19 (SEQ ID N0:21). 

Figure 24 depicts a hydropathy plot of human MANGO 245. Relatively 
hydrophobic regions are above the dashed horizontal line, and relatively hydrophilic 
5 regions are below the dashed horizontal line. The cysteine residues (cys) and 

potential N-glycosylation sites (Ngly) are indicated by short vertical lines just below 
the hydropathy trace. 

Figures 25A-25B depict the cDNA sequence (SEQ ID NO:22) and the 
predicted amino acid sequence (SEQ ID NO:23) of monkey MANGO 245. The open 
10 reading frame of SEQ ID NO:22 extends from nucleotide 250 to nucleotide 1236 of 
SEQ ID NO:22 (SEQ ID NO:24). 

Figure 26 depicts an alignment of the amino acid sequences of human 
MANGO 245 (SEQ ID NO:20) and monkey MANGO 245 (SEQ ID NO:23). This 
alignment was created using ALIGN (version 2.0; PAM120 scoring matrix, gap 
15 length penalty of 12; gap penalty of 4). In this alignment, the sequences are 84.8% 
identical. 

Figure 27 depicts alignments of the Clq domains of human MANGO 245 
(SEQ ID NOs:70 and 71) with a consensus hidden Markov model Clq domain (SEQ 
ID NO:72). In the consensus sequence, more conserved residues are indicated by 
20 uppercase letters, and less conserved residues are indicated by lowercase letters. A 
" within a sequence indicates a gap created in the sequence for purposes of alignment. 
A "H-" between the aligned sequences indicates a conservative amino acid difference. 

Figure 28 depicts alignments of the Clq domains of monkey MANGO 245 
(SEQ ED NOs:73 and 74) with a consensus hidden Markov model Clq domain (SEQ 
25 ID NO: 72). In the consensus sequence, more conserved residues are indicated by 
uppercase letters, and less conserved residues are indicated by lowercase letters. A 

within a sequence indicates a gap created in the sequence for purposes of 
alignment. A "-h" between the aligned sequences indicates a conservative amino acid 
difference. 

30 Figure 29 depicts the cDNA sequence (SEQ ID NO:25) of murine 

MANGO 245 and the predicted amino acid sequence (SEQ ID N0:91) of murine 
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MANGO 245. The open reading frame of SEQ ID NO:25 extends from nucleotide 29 
to nucleotide 625 of SEQ ID NO:25 (SEQ ID NO:92). 

Figures 30A-30B depict an alignment of nucleotide 51 to nucleotide 748 of 
human MANGO 245 (SEQ ID NO: 19) with murine MANGO 245 (SEQ ID NO:25)- 
5 This alignment was created using BESTFIT (BLOSUM 62 scoring matrix; gap open 
penalty of 12; frame shift penalty of 5; gap extend penalty of 4). In this alignment, 
the sequences are 89.6% identical. 

Figure 31 depicts an alignment of the amino acid sequence of human 
TANGO 246 (SEQ ID N0:5) and the amino acid sequence of Arabidopsis thaliana 
10 AIGl (Genbank Accession Number AAC49289; SEQ ID NO:87). 

Figure 32A-32B depicts an alignment of the amino acid sequence of 
murine TANGO 275 (SEQ ID NO: 1 1) and the amino acid sequence of murine LTBP- 
3 (GENSEQO Accession Number R79475; SEQ ID NO:83). This alignment was 
created using ALIGN (version 2.0; PAM120 scoring matrix, gap length penalty of 12; 
15 gap penalty of 4). In this ahgrmient, the sequences are 97.4% identical. 

Detailed Description of the Invention 

The present invention is based, at least in part, on the discovery of cDNA 
molecules encoding TANGO 244, TANGO 246, TANGO 275, TANGO 300, and 

20 MANGO 245, all of which are predicted to be either wholly secreted or 
transmembrane proteins. 

The proteins and nucleic acid molecules of the present invention comprise 
a family of molecules having certain conserved structural and ftmctional features. As 
used herein, the term "family*' is intended to mean two or more proteins or nucleic 

25 acid molecules having a common structural domain and having sufficient amino acid 
or nucleotide sequence identity as defined herein. Family members can be from either 
the same or different species. For example, a family can comprise two or more 
proteins of human origin, or can comprise one or more proteins of human origin and 
one or more of non-human origin. For example, the human MANGO 245 and 

30 monkey MANGO 245 genes described herein are both members of the MANGO 245 
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family. Two different polypeptides encoded by splice variants of a given transcript 
are also considered members of the same family. 

Members of the same family may also have common structural domains. 
For example, a TANGO 244, TANGO 246, TANGO 275, TANGO 300, or MANGO 
5 245 family member includes a signal peptide. As used herein, a "signal peptide" 
includes a peptide of at least about 1 5 amino acid residues in length w^hich occurs at 
the N-terminus of secretory and membrane-bound proteins and which contains at least 
about 70% hydrophobic amino acid residues such as alanine, leucine, isoleucine, 
phenylalanine, proline, tyrosine, tryptophan, or valine. The sequence can contain 

10 about 15 to 45 amino acid residues or about 17 to 22 amino acid residues, and has at 
least about 60-80%, 65-75%, or about 70% hydrophobic residues. A signal peptide 
serves to direct a protein containing such a sequence to a lipid bilayer. 

Thus, in one embodiment, a TANGO 244 protein contains a signal peptide 
of about amino acids 1 to 26 (1 to 24, 1 to 25, 1 to 27, or 1 to 28) of SEQ ID N0:2 

15 (SEQ ID NO:26). In one embodiment, a TANGO 275 protein contains a signal 
peptide of about amino acids 1 to 29 (1 to 27, 1 to 28, 1 to 30, 1 to 3 1) of SEQ ID 
N0:8 (SEQ ID NO:60). In one embodiment, a TANGO 300 protein contains a signal 
peptide of about amino acids 1 to 19 (1 to 17, I to 18, 1 to 20, 1 to 21) of SEQ ID 
NO: 14 or SEQ ID N0:17 (SEQ ID NO:62 and SEQ ID NO:64, respectively). In one 

20 embodiment, a MANGO 245 protein contains a signal peptide of amino acids 1 to 16 
(1 to 14, 1 to 15, 1 to 17, 1 to 18) of SEQ ID NO:20 or SEQ ID NO:23 (SEQ ID 
NO:66 and SEQ ID NO:68, respectively). 

The signal peptide is cleaved during processing of the mature protein. 
Sometimes the initial methionine residue is also cleaved from the protein during 

25 signal peptide processing. Thus, in one embodiment, a TANGO 244 protein does not 
contain a signal peptide or an initial methionine residue and begins from residue 2 of 
SEQ ED N0:2. In one embodiment, a TANGO 275 protein does not contain a signal 
peptide or an initial methionine residue and begins from residue 2 of SEQ ID N0:8. 
In one embodiment, a TANGO 300 protein does not contain a signal peptide or an 

30 initial methionine residue and begins from residue 2 of SEQ ID NO: 1 4 or SEQ ID 
NO: 17. Thus, in one embodiment, a MANGO 245 protein does not contain a signal 
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peptide or an initial methionine residue an begins from residue 2 of SEQ ID NO:20 or 
SEQIDNO:23. 

In some embodiments of the invention, the domains and the mature protein 
resulting from the cleavage of such signal peptides are also included herein. For 
5 example, the cleavage of a signal peptide consisting of amino acids 1 to 26 of SEQ ID 
N0:2 (SEQ ID NO:26) results in a mature TANGO 244 protein corresponding to 
amino acids 27-162 of SEQ ID N0:2 (SEQ ID NO:27). The signal peptide is 
normally cleaved during possessing of the mature protein. 

In another example, a TANGO 244, TANGO 246 or MANGO 245 family 
10 member also includes one or more of the following domains: (1) an extracellular 
domain; (2) a transmembrane domain; and (3) a cytoplasmic domain as described 
herein. 

TANGO 244 family members can also include an immunoglobulin 
domain. Immunoglobulin domains are present in a variety of proteins and are 

1 5 involved in protein-protein and protein-ligand interaction at the cell surface. A 
consensus hidden Markov model immunoglobulin domain has the sequence of SEQ 
ID NO:29. This consensus sequence is shown in Figure 3 where the more conserved 
residues in the consensus sequence are indicated by uppercase letters and the less 
conserved residues in the consensus sequence are indicated by lowercase letters. 

20 Human TANGO 244 includes a immunoglobulin domain at amino acids 37 to 97 of 
SEQ ID N0:2 (SEQ ID NO:28). 

TANGO 246 family members can also include a cell cycle protein domain. 
A consensus hidden Markov model cell cycle protein domain has the sequence of 
SEQ ID NO: 3 1 . This consensus sequence is shown in Figure 7 where the more 

25 conserved residues in the consensus sequence are indicated by uppercase letters and 
the less conserved residues in the consensus sequence are indicated by lowercase 
letters. Human TANGO 246 includes a cell cycle protein domain at amino acids 27 to 
215 of SEQ ID N0:5 (SEQ ID NO:30). Among the proteins which have a cell cycle 
protein domain are CDC3, CDCIO, and CDCl 1, all of which are important for 

30 regulation of the cell cycle. Many proteins which include this domain are GTP 
binding proteins. 
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In addition, TANGO 246 family members can also include an ABC 
transporter domain. A consensus hidden Markov model ABC transporter protein 
domain has the sequence of SEQ ID NO:33, This consensus sequence is shown in 
Figure 8 where the more conserved residues in the consensus sequence are indicated 
5 by uppercase letters and the less conserved residues in the consensus sequence are 
indicated by lowercase letters. The ABC transporter protein domain of TANGO 246 
is located at amino acids 30 to 192 of SEQ ID N0:5 (SEQ ID NO:32). A number of 
proteins having an ABC transporter protein domain act as active transporters of small 
hydrophilic molecules {e.g., ions) across cell membranes, including intracellular 

10 membranes. In eukaryotes, ABC transporter protein domains are present in multidrug 
resistance proteins. These protein are involved in extrusion of drugs from cells and 
play a key role in drug resistance. This domain is also present in cystic fibrosis 
transmembrane conductance regulator (CFTR), a protein that likely acts as a chloride 
ion transporter. Many proteins having an ABC transporter domain are ATP binding 

15 proteins. 

TANGO 275 family members can include an EGF-like domain. A 
consensus hidden Markov model EGF-like domain has the sequence of SEQ ID 
NO:49. This consensus sequence is shown in Figures 1 1 A-1 IB where the more 
conserved residues in the consensus sequence are indicated by uppercase letters and 

20 the less conserved residues in the consensus sequence are indicated by lowercase 
letters. Human TANGO 275 includes EFG-like domains at amino acids 99 to 126 
(SEQ ID NO:34), 345 to 380 (SEQ ID NO:35), 564 to 600 (SEQ ID NO:36), 606 to 
644 (SEQ ID NO:37), 650 to 687 (SEQ ID NO:38), 693 to 728 (SEQ ID NO:39), 734 
to 769 (SEQ ID NO:40), 775 to 810 (SEQ ID N0:41), 816 to 850 (SEQ ID NO:42), 

25 856 to 893 (SEQ ID NO:43), 983 to 1020 (SEQ ID NO:44), 1026 to 1061 (SEQ ID 
NO:45), 1072 to 1 107 (SEQ ID NO:46), 1203 to 1238 (SEQ ID NO:47), and 1244 to 
1283 (SEQ ID NO:48), One or more EGF-like domains (e.g., 1, 2, 4, 8, 13, 17, or 44 
copies) are found in the extracellular domain of a wide range of proteins of 
transmembrane and wholly secreted proteins having diverse function. The consensus 

30 EGF-like domain sequence includes six cysteines, all of which are thought to be 
involved in disulfide bonds. 
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TANGO 275 family members can include a transforming growth factor P 
binding protein-like domains (TB domains). A consensus hidden Markov model TB 
domain has the amino acid sequence of SEQ ID NO:54. This consensus sequence is 
shown in Figure 12 where the more conserved residues in the consensus sequence are 
5 indicated by uppercase letters and the less conserved residues in the consensus 
sequence are indicated by lowercase letters. Human TANGO 275 includes TB 
domains at amino acids 273 to 316 {SEQ ID NO:50), 399 to 440 (SEQ ID N0:51), 
913 to 956 (SEQ ID NO:52), and 1 132 to 11 77 (SEQ ID NO:53) of SEQ ID NO:8. 
A TB domain is found in matrix fibrils (Yuan et al, 1997, EMBO 1 16:6659-66). 

10 TANGO 275 family members can include a metallothionein domain. A 

consensus hidden Markov model metallothionein domain has the amino acid sequence 
of SEQ ID NO:56. This consensus sequence is shown in Figure 1 3 where the more 
conserved residues in the consensus sequence are indicated by uppercase letters and 
the less conserved residues in the consensus sequence are indicated by lowercase 

15 letters. Human TANGO 275 includes a metallothionein domain at amino acids 794 to 
708 (SEQ ID NO:55) of SEQ ID NO:8. Metallothionein domains are found in 
proteins which bind heavy metals (e.g., copper, zinc, cadmium, and nickel) through 
thiolate bonds. 

MANGO 245 family members can also include a Clq domain. A 
20 consensus hidden Markov model Clq domain has the amino acid sequence of SEQ ID 
NO:72. This consensus sequence is shown in Figure 27 where the more conserved 
residues in the consensus sequence are indicated by uppercase letters and the less 
conserved residues in the consensus sequence are indicated by lowercase letters. 
Human MANGO 245 includes Clq domains at amino acids 31 to 156 of SEQ ID 
25 NO:20 (SEQ ID NO:70) and amino acids 178 to 294 of SEQ ID NO:20 (SEQ ID 
NO: 71). Monkey MANGO 245 includes Clq domains at amino acids 31 to 156 of 
SEQ ID NO:23 (SEQ ID NO:73) and amino acids 178 to 31 1 of SEQ ID NO: (SEQ 
ID NO:74). Murine MANGO 245 includes a Clq domain at amino acids 30 to 155 of 
SEQ ID NO: 91 (SEQ ID NO:93), Clq domains are found in wholly secreted or 
30 membrane bound proteins that are short-chain collagens and collagen-like molecules. 
The domain likely forms ten P-strands interspersed by P-tums and/or loops. 
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Various features of TANGO 244, TANGO 246, TANGO 275, TANGO 
300, and MANGO 245 are summarized below. 

TANGO 244 

5 A cDNA encoding TANGO 244 was identified by analyzing the sequences 

of clones present in a human fetal lung cDNA library. 

This analysis led to the identification of a clone, Athua62r9, encoding full- 
length human TANGO 244. The cDNA of this clone is 1513 nucleotides long (Figure 
1; SEQ ID N0:1). The 486 nucleotide open reading frame of this cDNA, nucleotide 
10 85 to nucleotide 570 of SEQ ID N0:1 (SEQ ID N0:3), encodes a 162 amino acid 
protein (Figure 1 ; SEQ ID NO:2). 

The signal peptide prediction program SIGNAL? (Nielsen et al., 1997, 
Protein Engineering 10:1-6) predicted that human TANGO 244 includes a 26 amino 
acid signal peptide (amino acid 1 to about amino acid 26 of SEQ ID N0:2; SEQ ID 
15 NO:26) preceding the mature human TANGO 244 protein (corresponding to about 
amino acid 27 to amino acid 162 of SEQ ID N0:2; SEQ ID NO:27). 

Human TANGO 244 is a transmembrane protein having an extracellular 
domain which extends from about amino acid 27 to about amino acid 119 of SEQ ID 
N0:2 (SEQ ID NO:75), a transmembrane domain which extends from about amino 
20 acid 120 to about amino acid 142 of SEQ ID N0:2 (SEQ ID NO:76), and a 

cytoplasmic domain which extends from about amino acid 143 to amino acid 162 of 
SEQ ID N0:2 (SEQ ID NO:77). 

Alternatively, in another embodiment, a human TANGO 244 protein 
contains an extracellular domain at amino acid residues 143 to 162 of SEQ ID N0:2 
25 (SEQ ID NO:77), transmembrane domains at amino acid residues 120 to 142 of SEQ 
ID N0:2 (SEQ ID NO:76), and a cytoplasmic domain at amino acid residues 27 to 
1 19 of SEQ ID N0:2 (SEQ ID NO:75). 

Human TANGO 244 that has not been post-translationally modified is 
predicted to have a molecular weight of 16,8 kDa prior to cleavage of its signal 
30 peptide and a molecular weight of 14.2 kDa subsequent to cleavage of its signal 
peptide. 
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Human TANGO 244 includes an immunoglobulin domain at amino acids 
37 to 97 of SEQ ID N0:2 (SEQ ED NO:28). Figure 3 depicts an alignment of the 
immunoglobulin domain of human TANGO 244 with a consensus hidden Markov 
model immunoglobulin domain derived from a (SEQ ID NO:29). 
5 Within human TANGO 244, an N-glycosylation site is present at amino 

acids 84 to 87 of SEQ ID N0:2. A protein kinase C phosphorylation sites is present 
at amino acids 92 to 94 of SEQ ID N0:2. N-myristylation sites are present at amino 
acids 11 to 16, 37 to 42, 91 to 96, 102 to 107, and 122 to 127 of SEQ ID NO:2. An 
amidation site is present at amino acids 148 to 151 of SEQ ID N0:2. 

1 0 Clone Athua62f9, v^hich encodes human TANGO 244, was deposited as 

EpT244 with the American Type Culture Collection (ATCC® 10801 University 
Boulevard, Manassas, VA 201 10-2209) on April 21, 1999 and assigned Accession 
Number 207223. This deposit will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the 

15 Purposes of Patent Procedure. This deposit was made merely as a convenience for 
those of skill in the art and is not an admission that a deposit is required under 35 
U.S.C. §112. 

Figure 2 depicts a hydropathy plot of human TANGO 244. Relatively 
hydrophobic regions are above the horizontal line, and relatively hydrophilic regions 
20 are below the horizontal line. The cysteine residues (cys) are indicated by short 
vertical lines just below the hydropathy trace. The hydropathy plot of Figure 2 
indicates that human TANGO 244 has a signal peptide at its amino terminus and an 
internal hydrophobic region, suggesting that TANGO 244 is a transmembrane protein. 
Northern blot analysis of human TANGO 244 expression revealed that 
25 human TANGO 244 is expressed in the colon, kidney, liver, and lung. 

Himian TANGO 244 has sequence homology to human CTH (Marcuz et 
al., 1998, EuK J. Immunol 28:4094-4104; Genbank Accession Number AFO61022). 
Figure 4 depicts an alignment of the amino acid sequence of human TANGO 244 
(SEQ ID N0:2) and the amino acid sequence of human CTH (SEQ ID NO:81). In 
30 this alignment, the sequences are 48.6% identical overall. However, there is a 
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substantial region of complete identity. TANGO 244 may act as a immunoglobulin 
superfamily-type receptor. 

Use of TANGO 244 Nucleic Acids. Polypeptides, and Modulators Thereof 
5 TANGO 244 polypeptides, nucleic acids, and modulators thereof can be 

used to modulate the function, morphology, proliferation and/or differentiation of 
cells in the tissues in which they are expressed. Such molecules can be used to treat 
disorders associated with abnormal or aberrant metabolism or function of cells in the 
tissues in which they are expressed. Tissues in which TANGO 244 is expressed 

10 include, for example, the colon, kidney, liver, and lung. Such disorders include but 
are limited to lymphoma, leukemia, amyloidosis, scleroderma, mastocytosis. 

In one example, TANGO 244 polypeptides, nucleic acids, or modulators 
thereof can be used to treat colonic disorders, such as congenital anomalies {e.g., 
megacolon and imperforate anus), idiopathic disorders (eg., diverticular disease and 

15 melanosis coli), vascular lesions {e.g., ischemic colistis, hemorrhoids, 
angiodysplasia), inflammatory diseases (e.g., idiopathic ulcerative colitis, 
pseudomembranous colitis, and lymphopathia venereum), tumors (e.g., hyperplastic 
polyps, adenomatous polyps, bronchogenic cancer, colonic carcinoma, squamous cell 
carcinoma, adenoacanthomas, sarcomas, lymphomas, argentaffinomas, carcinoids, 

20 and melanocarcinomas) and Crohn's Disease. 

In another example, TANGO 244 polypeptides, nucleic acids, or 
modulators thereof can be used to treat renal disorders, such as glomerular diseases 
{e.g., acute and chronic glomerulonephritis, rapidly progressive glomerulonephritis, 
nephrotic syndrome, focal proliferative glomerulonephritis, glomerular lesions 

25 associated with systemic disease, such as systemic lupus erythematosus, 

Goodpasture's syndrome, multiple myeloma, diabetes, neoplasia, sickle cell disease, 
and chronic inflammatory diseases), tubular diseases (e.g., acute tubular necrosis and 
acute renal failure, polycystic renal disease, medullary sponge kidney, medullary 
cystic disease, nephrogenic diabetes, and renal tubular acidosis), tubulointerstitial 

30 diseases {e.g., pyelonephritis, drug and toxin induced tubulointerstitial nephritis, 
hypercalcemic nephropathy, and hypokalemic nephropathy) acute and rapidly 
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progressive renal failure, chronic renal failure, nephrolithiasis, gout, vascular diseases 
(e.g., hypertension and nephrosclerosis, microangiopathic hemolytic anemia, 
atheroembolic renal disease, diffuse cortical necrosis, and renal infarcts), or tumors 
(e.g., renal cell carcinoma and nephroblastoma). 
5 In another example, TANGO 244 polypeptides, nucleic acids, or 

modulators thereof can be used to treat hepatic (liver) disorders, such as jaundice, 
hepatic failure, hereditary hyperbiliruinemias {e.g., Gilbert*s syndrome, Crigler-Naijar 
syndromes and Dubin- Johnson and Rotor's syndromes), hepatic circulatory disorders 
{e.g., hepatic vein thrombosis and portal vein obstruction and thrombosis) hepatitis 

10 {e.g., chronic active hepatitis, acute viral hepatitis, and toxic and drug-induced 

hepatitis) cirrhosis {e.g., alcoholic cirrhosis, biliary cirrhosis, and hemochromatosis), 
or malignant tumors (eg., primary carcinoma, hepatoma, hepatoblastoma, liver cysts 
and angiosarcoma). 

In another example, TANGO 244 polypeptides, nucleic acids, or 

15 modulators thereof can be used to treat pulmonary (lung) disorders, such as 

atelectasis, cystic fibrosis, rheumatoid lung disease, pulmonary congestion or edema, 
chronic obstructive airway disease (e.g., emphysema, chronic bronchitis, bronchial 
asthma, and bronchiectasis), diffuse interstitial diseases (e.g., sarcoidosis, 
pneumoconiosis, hypersensitivity pneumonitis, bronchioHtis Goodpasture's 

20 syndrome, idiopathic pulmonary hemosiderosis, idiopathic pulmonary fibrosis, 
pulmonary alveolar proteinosis, desquamative interstitial pneumonitis, chronic 
interstitial pneumonia, fibrosing alveolitis, hamman-rich syndrome, pulmonary 
eosinophilia, diffuse interstitial fibrosis, Wegener's granulomatosis, lymphomatoid 
granulomatosis, and lipid pneumonia), or tumors (e.g., bronchogenic carcinoma, 

25 bronchioiovlveolar carcinoma, bronchial carcinoid, hamartoma, and mesenchymal 
tumors). 

Because TANGO 244 includes immunoglobulin domains and has 
homology to human CTH, TANGO 244 polypeptides, nucleic acids, and modulators 
thereof can be used to treat disorders involving an immune, allergic or autoimmune 
30 response (e.g., arthritis, multiple sclerosis, meningitis, encephalitis, atherosclerosis, or 
infection). 
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Further, in light of TANGO 244's pattern of expression in humans, 
TANGO 244 expression can be utiHzed as a marker for specific tissues (e.g., tissues 
of the colon, kidney, Uver, or lung) and/or cells (e.g., colon, renal, hepatic, or 
pulmonary) in which TANGO 244 is expressed, TANGO 244 nucleic acids can also 
5 be utilized for chromosomal mapping. 

TANGO 246 

A cDNA encoding human TANGO 246 was identified by analyzing the 
sequences of clones present in a human fetal spleen cDNA library. 

10 This analysis led to the identification of a clone, Athsa34d2, encoding full- 

length human TANGO 246. The cDNA of this clone is 1992 nucleotides long 
(Figures 5A-5B; SEQ ID N0:4). The 987 nucleotide open reading frame of this 
cDNA, nucleotide 94 to nucleotide 1080 of SEQ ID N0:4 (SEQ ID N0:6), encodes a 
329 amino acid protein (Figures 5A-5B; SEQ ID N0:5). 

15 Human TANGO 246 has a hydrophobic domain which extends from about 

amino acid 306 to about amino acid 323 of SEQ ID N0:5 (SEQ ID NO:58). This 
could represent a transmembrane domain or an internal signal peptide. This domain 
follows a domain which extends fi*om about amino acid 1 to about amino acid 305 of 
SEQ ID N0:5 (SEQ ID NO:57) and is followed by a domain which extends from 

20 about amino acid 324 to amino acid 329 of SEQ ID N0:5 (SEQ ID NO:59). 

Human TANGO 246 includes a cell cycle protein domain at amino acids 
27 to 215 of SEQ ID N0:5 (SEQ ID NO:30). Figure 7 depicts an alignment of the 
cell cycle protein domain of human TANGO 246 with a consensus hidden Markov 
model cell cycle protein domain (SEQ ID N0:31). 

25 Human TANGO 246 includes an ABC transporter domain at amino acids 

30 to 192 of SEQ ID N0:5 (SEQ ID NO:32). Figure 8 depicts an ahgnment of the 
ABC transporter domain of human TANGO 246 with a consensus hidden Markov 
model ABC transporter domain (SEQ ID NO:33). 

Human TANGO 246 that has not been post-translationally modified is 

30 predicted to have a molecular weight of 37.5 kDa. 
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Within human TANGO 246, a cAMP and cGMP-dependent protein kinase 
phosphorylation site is present at amino acids 71 to 74 of SEQ ID N0:5. Protein 
kinase C phosphorylation sites are present at amino acids 66 to 68, 75 to 77, 99 to 
101, 134 to 136, 154 to 156, and 222 to 224 of SEQ ID N0:5. Casein kinase II 
5 phosphorylation sites are present at amino acids 75 to 78, 99 to 102, 127 to 130, 154 
to 157, 194 to 197, and 299 to 302 of SEQ ID N0:5. A tyrosine kinase 
phosphorylation site is present at amino acids 214 to 221 of SEQ ID N0:5. 
N-myristylation sites are present at amino acids 40 to 45, 88 to 93, and 219 to 224 of 
SEQ ID N0:5. An ATP/GTP-binding site motif A is present at amino acids 37 to 44 
10 of SEQ ID NO: 5. An amidation site is present at amino acids 5 1 to 54 of SEQ ID 
NO:5. 

Clone Athsa34d2, which encodes human TANGO 246, was deposited as 
EpT246 with the American Type Culture Collection (ATCC® 10801 University 
Boulevard, Manassas, VA 201 10-2209) on April 21, 1999 and assigned Accession 
15 Number 207223. This deposit will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure. This deposit was made merely as a convenience for 
those of skill in the art and is not an admission that a deposit is required under 35 
U.S.C. §112. 

20 Figure 6 depicts a hydropathy plot of human TANGO 246. Relatively 

hydrophobic regions are above the horizontal line, and relatively hydrophilic regions 
are below the horizontal line. The cysteine residues (cys) are indicated by short 
vertical lines just below the hydropathy trace. The hydropathy plot of Figure 5 
indicates the presence of a hydrophobic domain within human TANGO 246, 

25 suggesting that human TANGO 246 is either a transmembrane protein or a secreted 
protein which employs an internal signal peptide. 

Human TANGO 246 has homology to Arabidopsis thaliana AIGl, a gene 
which is involved in resistance response (Genbank Accession Number AAC49289: 
Reuber and Ausubel, 1996, Plant Cell 8:241-249), and Nicotiana tabacum NTGP4 

30 (Genbank Accession Number AAD095 1 8). Figure 3 1 depicts an alignment of the 
amino acid sequence of human TANGO 246 (SEQ ID N0:5) and the amino acid 
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sequence of Arabidopsis thaliana AIGl (Genbank Accession Number AAC49289 
(SEQ ID NO:87). In this alignment, the proteins are 3 1 2% identical. 

Use of TANGO 246 Nucleic Acids, Polypeptides, and Modulators Thereof 
5 TANGO 246 polypeptides, nucleic acids, and modulators thereof can be 

used to modulate the function, morphology, proliferation and/or differentiation of 
cells in the tissues in which they are expressed. 

TANGO 246 includes an ABC transporter domain. Proteins having such a 
domain are involved in disorders of transport of small molecules across cell 

10 membranes. Proteins having an ABC transporter domain are known to be involved in 
cystic fibrosis, hyperinsulinemia, adrenoleukodystrophy, familial intrahepatic 
cholestasis, sideroblatic anemia and ataxia, Stargardt disease, multidrug resistance, 
and hyperbilirubinemia II/Dubin- Johnson syndrome. Thus, TANGO 246 
polypeptides, nucleic acids, and modulators thereof can be used to treat these and 

1 5 other disorders. 

TANGO 246 includes a cell cycle protein domain. Proteins having such a 
domain are involved in regulation of the cell cycle. Thus, TANGO 246 polypeptides, 
nucleic acids, and modulators thereof can be used to treat disorders such as 
Alzheimer's disease, vascular restinosis, polycystic kidney disease, transplant 

20 rejection, chronic liver disease, and cancer. 

Further, in light of TANGO 246's presence in a human fetal spleen cDNA 
library, TANGO 246 expression can be utilized as a marker for specific tissues {e.g., 
lymphoid tissues such as the thymus and spleen) and/or cells {e.g., lymphocytes and 
splenic) in which TANGO 246 is expressed. TANGO 246 nucleic acids can also be 

25 utilized for chromosomal mapping. 

TANGO 275 

A cDNA encoding human TANGO 275 was identified by analyzing the 
sequences of clones present in a human pituitary gland cDNA library. 
30 This analysis led to the identification of a clone, Athbbl9dl, encoding 

fiiU-Iength human TANGO 275. The cDNA of this clone is 4225 nucleotides long 
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(Figures 9A-9D; SEQ ID NO:7). The 3867 nucleotide open reading frame of this 
cDNA, nucleotide 65 to nucleotide 3931 of SEQ ID N0:7 (SEQ ID N0:9), encodes a 
1289 amino acid protein (Figures 9A-9D; SEQ ID N0:8). 

The signal peptide prediction program SIGNAL? (Nielsen et aL, 1997, 
5 Protein Engineering 10:1-6) predicted that human TANGO 275 includes a 29 amino 
acid signal peptide (amino acid 1 to about amino acid 29 of SEQ ID N0:8; SEQ ID 
NO:60) preceding the mature human TANGO 275 protein (corresponding to about 
amino acid 30 to amino acid 1289 of SEQ ID N0:8; SEQ ID NO:61). 

Human TANGO 275 that has not been post-translationally modified is 
10 predicted to have a molecular weight of 137.9 kDa prior to cleavage of its signal 
peptide and a molecular weight of 135.3 kDa subsequent to cleavage of its signal 
peptide. 

Human TANGO 275 includes EFG-like domains at amino acids 99 to 126 
(SEQ ID NO:34), 345 to 380 (SEQ ID NO:35), 564 to 600 (SEQ ID NO:36), 606 to 

15 644 (SEQ ID NO:37), 650 to 687 (SEQ ID NO:38), 693 to 728 (SEQ ID NO:39), 734 
to 769 (SEQ ID NO:40), 775 to 810 (SEQ ID N0:41), 816 to 850 (SEQ ID NO:42), 
856 to 893 (SEQ ID NO:43), 983 to 1020 (SEQ ID NO:44), 1026 to 1061 (SEQ ID 
NO:45), 1072 to 1 107 (SEQ ID NO:46), 1203 to 1238 (SEQ ID NO:47), and 1244 to 
1283 (SEQ ID NO:48). An ahgnment of each of the EGF-like domains of human 

20 TANGO 275 with a consensus hidden Markov model EGF-like domain (SEQ ID 
NO:49) is shown in Figures 1 1 A-1 IB. 

Human TANGO 275 includes transforming growth factor p binding 
protein like domains (TB domains) at amino acids 273 to 316 (SEQ ID NO:50), 399 
to 440 (SEQ ID N0;51), 913 to 956 (SEQ ID NO:52), and 1 132 to 1177 (SEQ ID 

25 NO:53) of SEQ ID N0:8. An alignment of each of the TB domains of human 

TANGO 275 with a consensus hidden Markov model TB domain (SEQ ID NO:54) is 
shown in Figure 12. 

Human TANGO 275 includes a metallothionein domain at amino acids 
694 to 708 (SEQ ID NO:55) of SEQ ID NO: 8. An alignment of the metallothionein 

30 domain of human TANGO 275 with a consensus hidden Markov model 
metallothionein domain (SEQ ID NO:56) is shown in Figure 13. 
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N-glycosylation sites are present at amino acids 75 to 78, 335 to 338, 831 
to 834, 922 to 925, and 1261 to 1264 of SEQ ID N0:8. 

Clone Athbbl9dl, which encodes human TANGO 275, was deposited as 
EpT275 with the American Type Culture Collection (ATCC® 10801 University 
5 Boulevard, Manassas, VA 201 10-2209) on April 21, 1999 and assigned Accession 
Number 207220. This deposit will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure. This deposit was made merely as a convenience for 
those of skill in the art and is not an admission that a deposit is required under 35 
10 U.S.C. §112, 

Figure 10 depicts a hydropathy plot of human TANGO 275. Relatively 
hydrophobic regions are above the horizontal line, and relatively hydrophilic regions 
are below the horizontal line. The cysteine residues (cys) are indicated by short 
vertical lines just below the hydropathy trace. The hydropathy plot of Figure 10 
1 5 indicates that human TANGO 275 has a signal peptide at its amino terminus, 
suggesting that human TANGO 275 is a secreted protein. 

Transcript analysis suggests that there are several splice variants of human 
TANGO 275. 

Human TANGO 275 appears to be the human homolog of a murine latent 
20 transforming growth factor-p binding protein 3 (LTBP-3; Yin et al., J. Biol. Chem. 
270:10147-60, 1995; Genbank Accession Number RL40459; PCT Application WO 
95/2261 1; GENSEQO Accession Number R79475). Figures 14A-14H depict an 
alignment of the nucleotide sequence of human TANGO 275 (SEQ ID N0:7) and the 
nucleotide sequence of murine LTBP-3 (Genbank Accession Number L40459; SEQ 
25 ID NO:82). This alignment was created using ALIGN (version 2.0; PAM120 scoring 
matrix; gap length penalty of 12; gap penalty of 4). In this alignment, the sequences 
are 77.1% identical. Figures 15A-15C depict an alignment of the amino acid 
sequence of human TANGO 275 (SEQ ID N0:8) and the amino acid sequence of 
murine LTBP-3 ( GENSEQO R79475; SEQ ID NO:83). This alignment was created 
30 using ALIGN (version 2.0; PAM120 scoring matrix; gap length penalty of 12; gap 
penalty of 4). In this alignment, the sequences are 82.8% identical. 
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Northern blot analysis of human TANGO 275 expression revealed that 
human TANGO 275 is expressed at a high level in the heart and at a moderate level in 
the brain, placenta, lung, liver, skeletal muscle, kidney and pancreas. 

A murine TANGO 275 cDNA was identified. The cDNA of this clone is 
5 4376 nucleotides long (Figures 16A-16G; SEQ ID NO:10). The 3759 nucleotide 
open reading frame of this cDNA, nucleotides of SEQ ID NO: 10 (SEQ ID NO: 12), 
encodes a 1253 amino acid protein (Figures 16A-16G; SEQ ID N0:1 1). Figures 32 A- 
32B depict an alignment of the amino acid sequence encoded by this murine TANGO 
275 cDNA clones (SEQ ID NO: 1 1) and the amino acid sequence of murine LTBP-3 
10 (GENSEQO Accession Number R79475; SEQ ID NO: 83). This alignment was 

created using ALIGN (version 2.0; PAM120 scoring matrix, gap length penalty of 12; 
gap penalty of 4). In this alignment, the sequences are 97.4% identical. 

Use of TANGO 275 Nucleic Acids. Polypeptides, and Modulators Thereof 

15 TANGO 275 polypeptides, nucleic acids, and modulators thereof can be 

used to modulate the function, morphology, proliferation and/or differentiation of 
cells in the tissues in which they are expressed. Such molecules can be used to treat 
disorders associated with abnormal or aberrant metabolism or function of cells in the 
tissues in which they are expressed. Tissues in which TANGO 275 is expressed 

20 include, for example, pancreas, adrenal medulla, adrenal cortex, kidney, thyroid, 
testis, stomach, heart, brain, liver, placenta, lung, skeletal muscle, or small intestine. 

As TANGO 275 exhibits expression in the heart, TANGO 275 
polypeptides, nucleic acids, or modulators thereof can be used to treat heart and 
cardiovascular disorders, such as ischemic heart disease (e.g., angina pectoris, 

25 myocardial infarction, and chronic ischemic heart disease), hypertensive heart disease, 
pulmonary heart disease, valvular heart disease (e.g., rheumatic fever and rheumatic 
heart disease, endocarditis, mitral valve prolapse, and aortic valve stenosis), 
congenital heart disease {e.g., valvular and vascular obstructive lesions, atrial or 
ventricular septal defect, and patent ductus arteriosus), or myocardial disease (e.g., 

30 myocarditis, congestive cardiomyopathy, and hypertrophic cardiomyopathy). 

Disorders of the vasculature that can be treated or prevented according to the methods 
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of the invention include atheroma, tumor angiogenesis, wound heahng, diabetic 
retinopathy, hemangioma, psoriasis, and restenosis, e.g., restenosis resulting from 
balloon angioplasty. 

In another example, TANGO 275 polypeptides, nucleic acids, or 
5 modulators thereof can be used to treat disorders of the brain, such as cerebral edema, 
hydrocephalus, brain herniations, iatrogenic disease (due to, e.g., infection, toxins, or 
drugs), inflammations (e.g., bacterial and viral meningitis, encephalitis, and cerebral 
toxoplasmosis), cerebrovascular diseases (e.g., hypoxia, ischemia, and infarction, 
intracranial hemorrhage and vascular malformations, and hypertensive 

10 encephalopathy), and tumors (e.g., neuroglial tumors, neuronal tumors, tumors of 
pineal cells, meningeal tumors, primary and secondary lymphomas, intracranial 
tumors, and medulloblastoma), and to treat injury or trauma to the brain (e.g., spinal 
cord injuries, infarction, infection, malignancy, exposure to toxic agents, nutritional 
deficiency, paraneoplastic syndromes), degenerative nerve diseases (including but not 

15 limited to Alzheimer's disease, Parkinson's disease, Huntington's Chorea, Gilles de la 
Tourette's syndrome, amyotrophic lateral sclerosis, progressive supra-nuclear palsy, 
and other dementias), and neuropsychiatric disorders (including schizophrenia, 
schizoaffective disorder, attention deficit disorder, dysthymic disorder, major 
depressive disorder, mania, obsessive-compulsive disorder, psychoactive substance 

20 use disorders, anxiety, panic disorder, as well as bipolar affective disorder, e.g., severe 
bipolar affective disorder, bipolar affective disorder with hypomania and major 
depression). 

In another example, TANGO 275 polypeptides, nucleic acids, or 
modulators thereof can be used to treat placental disorders, such as toxemia of 
25 pregnancy (e.g., preeclampsia and eclampsia), placentitis, or spontaneous abortion. 
In another example, TANGO 275 polypeptides, nucleic acids, or 
modulators thereof can be used to treat pulmonary (lung) disorders, such as 
atelectasis, cystic fibrosis, rheumatoid lung disease, pulmonary congestion or edema, 
chronic obstructive airway disease (e.g., emphysema, chronic bronchitis, bronchial 
30 asthma, and bronchiectasis), diffuse interstitial diseases (e.g., sarcoidosis, 

pneumoconiosis, hypersensitivity pneumonitis, Goodpasture's syndrome, idiopathic 
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pulmonary hemosiderosis, pulmonary alveolar proteinosis, desquamative interstitial 
pneumonitis, chronic interstitial pneumonia, fibrosing alveolitis, hamman-rich 
syndrome, pulmonary eosinophilia, diffuse interstitial fibrosis, Wegener's 
granulomatosis, lymphomatoid granulomatosis, and lipid pneumonia), or tumors {e.g., 
5 bronchogenic carcinoma, bronchiolovlveolar carcinoma, bronchial carcinoid, 
hamartoma, and mesenchymal tumors). 

In another example, TANGO 275 polypeptides, nucleic acids, or 
modulators thereof can be used to treat hepatic disorders, such as jaundice, hepatic 
failure, liver cysts, chronic liver disease, hereditary hyperbiliruinemias (e.g., Gilbert's 

10 syndrome, Crigler-Naijar syndromes and Dubin- Johnson and Rotor's syndromes), 
hepatic circulatory disorders (e.g., hepatic vein thrombosis and portal vein obstruction 
and thrombosis) hepatitis (e.g., chronic active hepatitis, acute viral hepatitis, and toxic 
and drug-induced hepatitis) cirrhosis (e.g., alcoholic cirrhosis, biliary cirrhosis, and 
hemochromatosis), or malignant tumors (e.g., primary carcinoma, hepatoblastoma, 

15 and angiosarcoma). 

In another example, TANGO 275 polypeptides, nucleic acids, or 
modulators thereof can be used to treat disorders of skeletal muscle, such as muscular 
dystrophy (e.g., Duchenne muscular dystrophy, Becker Muscular Dystrophy, Emery- 
Dreifuss muscular dystrophy, Limb-Girdle muscular dystrophy, Facioscapulohumeral 

20 muscular dystrophy, myotonic dystrophy, oculopharyngeal muscular dystrophy, distal 
muscular dystrophy, and congenital muscular dystrophy), motor neuron diseases (e.g., 
amyotrophic lateral sclerosis, infantile progressive spinal muscular atrophy, 
intermediate spinal muscular atrophy, spinal bulbar muscular atrophy, and adult spinal 
muscular atrophy), myopathies (e.g., inflammatory myopathies (e.g., dermatomyositis 

25 and polymyositis), myotonia congenita, paramyotonia congenita, central core disease, 
nemaline myopathy, myotubular myopathy, and periodic paralysis), and metabolic 
diseases of muscle (e.g., phosphorylase deficiency, acid maUase deficiency, 
phosphofructokinase deficiency, Debrancher enzyme deficiency, mitochondrial 
myopathy, carnitine deficiency, carnitine palmityl transferase deficiency, 

30 phosphoglycerate kinase deficiency, phosphoglycerate mutase deficiency, lactate 
dehydrogenase deficiency, and myoadenylate deaminase deficiency). 
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In another example, TANGO 275 polypeptides, nucleic acids, or 
modulators thereof can be used to treat renal disorders, such as glomerular diseases 
(e.g., acute and chronic glomerulonephritis, rapidly progressive glomerulonephritis, 
nephrotic syndrome, focal proliferative glomerulonephritis, glomerular lesions 
5 associated with systemic disease, such as systemic lupus erythematosus, 

Goodpasture's syndrome, multiple myeloma, diabetes, neoplasia, sickle cell disease, 
and chronic inflammatory diseases), tubular diseases {e.g., acute tubular necrosis and 
acute renal failure, polycystic renal diseasemedullary sponge kidney, medullary cystic 
disease, nephrogenic diabetes, and renal tubular acidosis), tubulointerstitial diseases 

10 (e.g., pyelonephritis, drug and toxin induced tubulointerstitial nephritis, 

hypercalcemic nephropathy, and hypokalemic nephropathy) acute and rapidly 
progressive renal failure, chronic renal failure, nephrolithiasis, vascular diseases (e.g., 
hypertension and nephrosclerosis, microangiopathic hemolytic anemia, atheroembolic 
renal disease, diffuse cortical necrosis, and renal infarcts), or tumors (e.g., renal cell 

1 5 carcinoma and nephroblastoma) . 

In another example, TANGO 275 polypeptides, nucleic acids, or 
modulators thereof can be used to treat pancreatic disorders, such as pancreatitis (e.g., 
acute hemorrhagic pancreatitis and chronic pancreatitis), pancreatic cysts (e.g., 
congenital cysts, pseudocysts, and benign or malignant neoplastic cysts), pancreatic 

20 tumors (e.g., pancreatic carcinoma and adenoma), diabetes mellitus (e.g., insulin- and 
non-insulin-dependent types, impaired glucose tolerance, and gestational diabetes), or 
islet cell tumors (e.g., insulinomas, adenomas, Zollinger-Ellison syndrome, 
glucagonomas, and somatostatinoma). 

TANGO 275 includes an EGF-like domain. Proteins having such domains 

25 play a role in a wide variety of biological processes, including cholesterol uptake, 
blood coagulation, and specification of cell fate. Thus, TANGO 275 polypeptides, 
nucleic acids, and modulators thereof can be used modulate these processes. TANGO 
275 polypeptides, nucleic acids, and modulators thereof can be used to modulate cell 
proliferation, morphogenesis, tissue repair and renewal, terminal differentiation, cell 

30 survival, and cell migration. They can be used to treat cancer, promote wound healing 
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(e.g., of the slcin, cornea, or mucosa), and modulate an allergic or inflammatory 
response. 

TANGO 275 includes a TB domain. Proteins having this domain are 
commonly associated with extracellular matrix fibrils. TANGO 275 polypeptides, 
5 nucleic acids, and modulators thereof can be used to modulate matrix formation and 
degradation and to treat disorders of the connective tissue, e.g., Marfan syndrome. 

As a transforming growth factor-P binding protein, TANGO 275 can 
interact with transforming growth factor-P (TGF-P). In general, transforming growth 
factor-P binding proteins (LTBP) bind to TGF-P to form latent growth factor 

10 complexes (large latent complexes). LTBP are important regulators of TGF-p 
activity. LTBP are thought to facilitate the normal assembly and secretion of large 
latent complexes, target latent TGF-P to certain connective tissues, modulate the 
activity of large latent complexes, and target latent TGF-P to the cell surface. Given 
that TANGO 275 can modulate TGF-P activity, TANGO 275 polypeptides, nucleic 

15 acids, and modulators of TANGO 275 expression or activity can be used to treat 
connective tissue and bone disorders such as bone fracture, osteoporosis, and 
osteogenesis imperfecta. In addition, such compounds can be used to promote bone 
repair, promote bone regeneration, and improve bone implant bonding. Thus, 
TANGO 275 polypeptides, nucleic acids, and modulators thereof can be used to 

20 modulate various aspects of bone repair and regeneration, including, e.g., clot 

formation, clot dissolution, removal of damaged tissue, growth of granulation tissue, 
cartilage growth and turnover, formation of callus tissue, remodeling, formation of 
trabecular bone, and formation of cortical bone. 

Further, in light of TANGO 275's pattern of expression in humans, 

25 TANGO 275 expression can be utilized as a marker for specific tissues (e.g., 

cardiovascular tissue such as the heart) and/or cells (e.g., cardiac) in which TANGO 
275 is expressed. TANGO 275 nucleic acids can also be utilized for chromosomal 
mapping. 
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TANGO 300 

A cDNA encoding human TANGO 300 was identified by analyzing the 
sequences of clones present in a human fetal lung cDNA Hbrary. 

This analysis led to the identification of a sequence encoding full-length 
5 human TANGO 300. The cDNA of this clone is 1332 nucleotides long (Figure 17A- 
17B; SEQ ID N0:13). The 1083 nucleotide open reading frame of this cDNA, 
nucleotide 31 to nucleotide 1113 of SEQ ID NO: 13 (SEQ ID NO: 15), encodes a 361 
amino acid protein (Figure 17A-17B; SEQ ID NO: 14). 

The signal peptide prediction program SIGNAL? (Nielsen et al., 1997, 

10 Protein Engineering 10:1-6) predicted that human TANGO 300 includes a 20 amino 
acid signal peptide (amino acid 1 to about amino acid 20 of SEQ ID NO: 14; SEQ ID 
NO:62) preceding the mature human TANGO 300 protein (corresponding to about 
amino acid 21 to amino acid 361 of SEQ ID N0:14; SEQ ID NO:63). 

Human TANGO 300 is a transmembrane protein having an extracellular 

15 domain which extends from about amino acid 21 to about amino acid 304 of SEQ ID 
NO: 14 (SEQ ED NO:85), a transmembrane domain which extends from about amino 
acid 305 to about amino acid 321 of SEQ ID NO: 14 (SEQ ID NO:86), and a 
cytoplasmic domain which extends from about amino acid 322 to amino acid 361 of 
SEQ ID NO: 1 4 (SEQ ID NO:87). 

20 Alternatively, in another embodiment, a human TANGO 300 protein 

contains an extracellular domain at amino acid residues 322 to amino acid 361 of SEQ 
ID NO: 14 (SEQ ID NO:87), transmembrane domains at amino acid residues 305 to 
about amino acid 321 of SEQ ID NO: 14 (SEQ ID NO: 86), and a cytoplasmic domain 
at amino acid 21 to about amino acid 304 of SEQ ID NO: 14 (SEQ ID NO:85). 

25 Human TANGO 300 that has not been post-translationally modified is 

predicted to have a molecular weight of 40.6 kDa prior to cleavage of its signal 
peptide and a molecular weight of 38.5 kDa subsequent to cleavage of its signal 
peptide. 

Within human TANGO 300, protein kinase C phosphorylation sites are 
30 present at amino acids 74 to 76, 89 to 91, 307 to 309, and 359 to 361 of SEQ ID 

NO: 14. Casein kinase II phosphorylation sites are present at amino acids 34 to 37, 41 
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to 44, 74 to 77, 153 to 156, and 169 to 172 of SEQ ID N0:14. Tyrosine kinase 
phosphorylation sites are present at amino acids 1 11 to 1 17 and 236 to 243 of SEQ ID 
NO: 14. N-myristylation sites are present at amino acids 25 to 30 and 170 to 175 of 
SEQ ID NO: 14. 

5 Clone AthX672i5, which encodes human TANGO 300, was deposited as 

EpT300 with the American Type Culture Collection (ATCC® 10801 University 
Boulevard, Manassas, VA 201 10-2236) on June 30, 1999 and assigned Accession 
Number PTA-293. This deposit will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the 
10 Purposes of Patent Procedure. This deposit was made merely as a convenience for 
those of skill in the art and is not an admission that a deposit is required under 35 
U.S.C. §112. 

Figure 18 depicts a hydropathy plot of human TANGO 300, Relatively 
hydrophobic regions are above the horizontal line, and relatively hydrophilic regions 
15 are below the horizontal line. The cysteine residues (cys) are indicated by short 
vertical lines just below the hydropathy trace. The hydropathy plot of Figure 1 8 
indicates that human TANGO 300 has a signal peptide at its amino terminus and an 
internal hydrophobic region, suggesting that human TANGO 300 is a transmembrane 
protein. 

20 A clone, jthub009c07, containing murine TANGO 300 was also identified. 

The cDNA of this clone is 1400 nucleotides long (Figures 19A-19C; SEQ ID NO: 16). 
The 1 155 nucleotide open reading frame of this cDNA, nucleotide 41 to nucleotide 
1 195 of SEQ ID NO: 16 (SEQ ID NO: 18), encodes a 385 amino acid protein (Figures 
19A-19C;SEQ ID N0:17). 

25 The signal peptide prediction program SIGNALP (Nielsen et al., 1997, 

Protein Engineering 10:1-6) predicted that murine TANGO 300 includes a 19 amino 
acid signal peptide (amino acid 1 to about amino acid 19 of SEQ ID NO: 16; SEQ ID 
NO:64) preceding the mature murine TANGO 300 protein (corresponding to about 
amino acid 20 to amino acid 385 of SEQ ID NO: 16; SEQ ID NO:65). 

30 Murine TANGO 300 is a transmembrane protein having an extracellular 

domain which extends from about amino acid 20 to about amino acid 318 of SEQ ID 
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NO: 16 (SEQ ID NO: 88), a transmembrane domain which extends from about amino 
acid 319 to about amino acid 335 of SEQ ID N0:16 (SEQ ID NO:89), and a 
cytoplasmic domain which extends from about amino acid 336 to amino acid 385 of 
SEQ ID NO: 16 (SEQ ID NO:90). 
5 Alternatively, in another embodiment, a murine TANGO 300 protein 

contains an extracellular domain at amino acid residues 336 to amino acid 385 of SEQ 
ID N0:16 (SEQ ID NO:90), transmembrane domains at amino acid residues 319 to 
about amino acid 335 of SEQ ID NO: 16 (SEQ ID NO:89), and a cytoplasmic domain 
at amino acid 20 to about amino acid 318 of SEQ ID N0:16 (SEQ ID NO:88). 
10 Murine TANGO 300 that has not been post-translationally modified is 

predicted to have a molecular weight of 43.1 kDa prior to cleavage of its signal 
peptide and a molecular weight of 41 .0 kDa subsequent to cleavage of its signal 
peptide. 

Within murine TANGO 300, protein kinase C phosphorylation sites are 
15 present at amino acids 85 to 87 and 378 to 380 of SEQ ID N0:17. Casein kinase U 
phosphorylation sites are present at amino acids 22 to 25, 37 to 40, 149 to 152, 165 to 
168 and 287 to 290 of SEQ ID NO: 17. A tyrosine kinase phosphorylation site is 
present at amino acids 107 to 1 13 of SEQ ID NO: 17. N-myristylation sites are 
present at amino acids 29 to 34, 89 to 94, 166 to 171 and 207 to 212 of SEQ ID 
20 NO: 17. A N-glycosylation site is present at amino acids 136 to 139 of SEQ ID 
N0:17. 

Figure 20 depicts a hydropathy plot of murine TANGO 300. Relatively 
hydrophobic regions are above the horizontal line, and relatively hydrophiiic regions 
are below the horizontal line. The cysteine residues (cys) are indicated by short 
25 vertical lines just below the hydropathy trace. The hydropathy plot of Figure 20 
indicates that murine TANGO 300 has a signal peptide at its amino terminus and an 
internal hydrophobic region, suggesting that murine TANGO 300 is a transmembrane 
protein. 

Figures 21 A-21B depict an alignment of the ORF nucleotide sequence of 
30 human TANGO 300 (SEQ ID NO: 15) and the ORF nucleotide sequence of murine 
TANGO 300 (SEQ ID N0:1 8). This alignment was created using BESTFIT 
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(BLOSUM 62 scoring matrix; gap open penalty of 12; frame shift penalty of 5; gap 
extend penalty of 4). In this alignment, the sequences are 77.7% identical. Figure 22 
depicts an alignment of the amino acid sequence of human TANGO 300 (SEQ ID 
NO: 14) and the amino acid sequence of murine TANGO 300 (SEQ ID NO: 17), This 
5 alignment was created using BESTFIT (BLOSUM 62 scoring matrix; gap open 
penalty of 12; frame shift penalty of 5; gap extend penalty of 4). In this alignment, 
the sequences are 69.6% identical. The ftill length nucleotide sequences of human 
TANGO 300 and murine TANGO 300 display 75.8% identity. 

10 Use of TANGO 300 Nucleic Acids, Polypeptides, and Modulators Thereof 

TANGO 300 polypeptides, nucleic acids, and modulators thereof can be 
used to modulate the fiinction, morphology, proliferation and/or differentiation of 
cells in the tissues in which they are expressed. 

Further, in light of TANGO 300's presence in a fetal lung cDNA library, 
15 TANGO 300 expression can be utilized as a marker for specific tissues {e.g., lung) 
and/or cells (e.g., pulmonary) in which TANGO 300 is expressed. TANGO 300 
nucleic acids can also be utilized for chromosomal mapping. 

MANGO 245 

20 A cDNA encoding MANGO 245 was identified by analyzing the 

sequences of clones present in a human adult brain cDNA library. 

This analysis led to the identification of a clone, Alhbabl65e5, encoding 
fiiU-length human MANGO 245. The cDNA of this clone is 1356 nucleotides long 
(Figures 23A-23B; SEQ ID NO: 19). The 1044 nucleotide open reading frame of this 

25 cDNA, nucleotide 105 to nucleotide 1 148 of SEQ ID NO: 19 (SEQ ID N0:21), 
encodes a 348 amino acid protein (Figures 23A-23B; SEQ ID NO:20). 

The signal peptide prediction program SIGNAL? (Nielsen et al., 1997, 
Protein Engineering 10:1-6) predicted that human MANGO 245 includes a 16 amino 
acid signal peptide (amino acid 1 to about amino acid 16 of SEQ ID NO:20; SEQ ID 

30 NO:66) preceding the mature human MANGO 245 protein (corresponding to about 
amino acid 17 to amino acid 348 of SEQ ID NO:20; SEQ ID NO:67). 
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Human MANGO 245 is a transmembrane protein having an extracellular 
domain which extends from about amino acid 17 to about amino acid 141 of SEQ ID 
NO:20 (SEQ ID NO:78), a transmembrane domain which extends from about amino 
acid 142 to about amino acid 158 of SEQ ID NO:20 (SEQ ID NO:79), and a 
5 cytoplasmic domain which extends from about amino acid 159 to amino acid 348 of 
SEQ ID NO:20 (SEQ ID NO.SO). 

Alternatively, in another embodiment, a murine TANGO 300 protein 
contains an extracellular domain at amino acid residues 159 to amino acid 348 of SEQ 
ID NO: 20 (SEQ ID NO: 80), transmembrane domains at amino acid residues 142 to 
10 about amino acid 158 of SEQ ID NO:20 (SEQ ID NO:79), and a cytoplasmic domain 
at amino acid 17 to about amino acid 141 of SEQ ID NO:20 (SEQ ID NO;78). 

Human MANGO 245 that has not been post-translationally modified is 
predicted to have a molecular weight of 37.9 kDa prior to cleavage of its signal 
peptide and a molecular weight of 36.3 kDa subsequent to cleavage of its signal 
15 peptide. 

Human MANGO 245 includes Clq domains at amino acids 31 to 156 of 
SEQ ID NO:20 (SEQ ID NO:70) and amino acids 178 to 294 of SEQ ID NO:20 (SEQ 
ID N0:71). Figure 27 depicts alignments of the Clq domains of human MANGO 245 
with a consensus hidden Markov model Clq domain (SEQ ID NO:72). 

20 Within MANGO 245, protein kinase C phosphorylation sites are present at 

amino acids 244 to 246 and 264 to 266 of SEQ ID NO:20. Casein kinase II 
phosphorylation sites are present at amino acids 38 to 41 and 298 to 301 of SEQ ID 
NO:20. N-myristylation sites are present at amino acids 66 to 71, 1 13 to 118, 145 to 
150, 219 to 224, and 295 to 300 of SEQ ID NO:20. 

25 Clone Alhbabl65e5, which encodes human MANGO 245, was deposited 

as EpM245 with the American Type Culture Collection (ATCC® 10801 University 
Boulevard, Manassas, VA 201 10-2209) on April 21, 1999 and assigned Accession 
Number 207223. This deposit will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the 

30 Purposes of Patent Procedure. This deposit was made merely as a convenience for 
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those of skill in the art and is not an admission that a deposit is required under 35 
U.S.C. §112. 

Figure 24 depicts a hydropathy plot of human MANGO 245. Relatively 
hydrophobic regions are above the horizontal line, and relatively hydrophilic regions 
5 are below the horizontal line. The cysteine residues (cys) are indicated by short 
vertical lines just below the hydropathy trace. The hydropathy plot of Figure 24 
indicates that human MANGO 245 has a signal peptide at its amino terminus and an 
internal hydrophobic region, suggesting that human MANGO 245 is a transmembrane 
protein. 

10 Northern blot analysis of human MANGO 245 expression revealed that 

human MANGO 245 is expressed at a relatively high level in the cerebellum, frontal 
lobe, and putamen; at a moderate level in the cerebral cortex, the medulla, occipital 
lobe, and temporal lobe; and a relatively low level in the spinal cord. Additional 
Northern blot analysis revealed the human MANGO 245 is expressed in amygdala, 

15 caudate nucleus, hippocampus, brain, substantia nigra, and subthalamic nucleus. 

A cDNA encoding monkey MANGO 245 was identified by analyzing the 
sequences of clones present in a monkey cDNA library. 

This analysis led to the identification of a clone, Alkbd75hl, encoding 
full-length monkey MANGO 245. The cDNA of this clone is 1416 nucleotides long 

20 (Figures 25A-25B; SEQ ID NO:22). The 987 nucleotide open reading fi-ame of this 
cDNA, nucleotide 250 to nucleotide 1236 of SEQ ID NO:22 (SEQ ID NO:24), 
encodes a 329 amino acid protein (Figures 25A-25B; SEQ ID NO:23). 

The signal peptide prediction program SIGNAL? (Nielsen et aL, 1997, 
Protein Engineering 10:1-6) predicted that monkey MANGO 245 includes a 16 

25 amino acid signal peptide (amino acid 1 to about amino acid 16 of SEQ ID NO:23; 
SEQ ID NO:68) preceding the mature monkey MANGO 245 protein (corresponding 
to about amino acid 17 to amino acid 329 of SEQ ID NO:23; SEQ ID NO:69). 

Monkey MANGO 245 that has not been post-translationally modified is 
predicted to have a molecular weight of 35.2 kDa prior to cleavage of its signal 

30 peptide and a molecular weight of 33.6 kDa subsequent to cleavage of its signal 
peptide. 
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Monkey MANGO 245 includes Clq domains at amino acids 31 to 156 of 
SEQ ID NO:23 (SEQ ID NO:73) and amino acids 178 to 31 1 of SEQ ID NO:23 (SEQ 
ID NO:74). Figure 28 depicts alignments of the Clq domains of monkey MANGO 
245 with a consensus hidden Markov model Clq domain (SEQ ID NO:72). 
5 Figure 26 depicts an alignment of the amino acid sequence of human 

MANGO 245 (SEQ ID NO:20) and the amino acid sequence of monkey MANGO 
245 (SEQ ID NO:23). This alignment was created using ALIGN (version 2.0; 
PAM120 scoring matrix; gap length penalty of 12; gap penalty of 4). In this 
alignment, the sequences are 84.8% identical overall. 

10 Clone Alkbd75hl, which encodes monkey MANGO 245, was deposited as 

EpK245 with the American Type Culture Collection (ATCC® 10801 University 
Boulevard, Manassas, VA 201 10-2209) on June 18, 1999 and assigned Accession 
Number PTA-248. This deposit will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the 

15 Purposes of Patent Procedure. This deposit was made merely as a convenience for 
those of skill in the art and is not an admission that a deposit is required under 35 
U.S.C. §112. 

In addition, a murine MANGO 245 was identified. The cDNA of this 
clone is 625 nucleotides long (Figure 29; SEQ ID NO:25). The open reading frame of 
20 this cDNA is begins at nucleotide 29 of SEQ ID NO:25. Murine MANGO 245 

includes a Clq domain at amino acids 30 to 155 of SEQ ID NO: 91 (SEQ ID NO:93). 

Within murine MANGO 245, protein kinase C phosphorylation sites are 
present at amino acids 64 to 66 and 178 to 180 of SEQ ID N0:91 . N-myristylation 
sites are present at amino acids 1 12 to 117 and 144 to 149 of SEQ ID N0:91 . A 
25 casein kinase II phosphorylation site is present at amino acids 37 to 40 of SEQ ID 
N0:91. An N-glycosylation site is present at amino acids 88 to 91 of SEQ ID N0:91. 

Figures 30A-30B depict an alignment of 697 of the 1356 nucleotides of the 
human MANGO 245 sequence (nucleotide 5 1 to nucleotide 748 of SEQ ID NO: 19) 
with the nucleotide sequence of murine MANGO 245 (SEQ ID NO:25). This 
30 alignment was created using BESTFIT (BLOSUM 62 scoring matrix; gap open 
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penalty of 12; frame shift penalty of 5; gap extend penalty of 4). In this alignment, 
the sequences are 89.6% identical overall. 

Use of MANGO 245 Nucleic Acids, Polypeptides, and Modulators Thereof 
5 MANGO 245 polypeptides, nucleic acids, and modulators thereof can be 

used to modulate the ftmction, morphology, proliferation and/or differentiation of 
cells in the tissues in which they are expressed. MANGO 245 is expressed in the 
brain and central nervous system. Thus, MANGO 245 polypeptides, nucleic acids, 
and modulators thereof can be used to treat CNS disorders such as Alzheimer's 

10 disease, senile dementia, Huntington's disease, amyotrophic lateral sclerosis, and 
Parkinson's disease, as well as Gilles de la Tourette's syndrome, autonomic function 
disorders such as hypertension and sleep disorders, and neuropsychiatric disorders 
that include, but are not limited to schizophrenia, schizoaffective disorder, attention 
deficit disorder, dysthymic disorder, major depressive disorder, mania, obsessive- 

1 5 compulsive disorder, psychoactive substance use disorders, anxiety, panic disorder, as 
well as bipolar affective disorder, e.g., severe bipolar affective (mood) disorder (BP- 
I), bipolar affective (mood) disorder with hypomania and major depression (BP-II). 

MANGO 245 includes a Clq domain. Known proteins having this domain 
play a role complement activation and autoimmune disorders. The Clq domain is also 

20 found in collagens and collagen-like molecules. MANGO 245 polypeptides, nucleic 
acids, and modulators thereof can be used to treat disorders of collagen assembly and 
degradation. 

Further, in light of MANGO 245's pattern of expression in humans, 
MANGO 245 expression can be utilized as a marker for specific tissues {e.g., brain) 
25 and/or cells (e.g., cerebellum, frontal lobe, or putamen) in which MANGO 245 is 
expressed. MANGO 245 nucleic acids can also be utilized for chromosomal 
mapping. 
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Tables 1 and 2 below provide summaries of TANGO 244, TANGO 246, 
TANGO 275, TANGO 300 and MANGO 245 sequence and protein domain 
information. 



5 TABLE 1 : Summary of Sequence Information for TANGO 244, TANGO 246, 
TANGO 275, TANGO 300 and MANGO 245 





Gene 


CDNA 


ORF 


Polypeptide 


Figure 


ATCC® 

Accession 

Number 


10 


TANGO 244 

nunidii 


SEQ ID N0:1 


SEQ ID N0:3 


SEQ ID NO:2 


Fig. 1 


207223 




TANGO 246 
Human 


SEQ ID N0:4 


SEQ ID N0:6 


SEQ ID N0;5 


Fig. 5A-5B 


207223 




TANGO 275 
Human 


SEQ ID N0:7 


SEQ ID N0:9 


SEQ ID N0:8 


Fig. 9A-9D 


207220 


15 


TANGO 245 
Mouse 


SEQ ID NO; 10 


SEQ ID NO: 12 


SEQ ID NO: 11 


Fig. 16A-16G 






TANGO 300 


SEQ ID NO : 13 


SEQ ID NO: 15 


SEQ ID NO: 14 


Fig. 17A-17B 


PTA-293 




Human 














TANGO 300 


SEQ ID NO; 1 6 


SEQ ID NO: 1 8 


SEQ ID NO: 17 


Fig. 19A-19C 




20 


Mouse 














MANGO 245 


SEQ IDN0:19 


SEQ IDNO:21 


SEQ ID NO: 20 


Fig. 23 


207223 




Human 














MANGO 245 


SEQ ID NO:22 


SEQ ID NO;24 


SEQ ID NO;23 


Fig. 25A-25B 


PTA-248 




Monkey 












25 


MANGO 245 
Mouse 


SEQ ID NO:25 


SEQ ID NO:92 


SEQ ID N0:91 


Fig. 29 
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TABLE 2: Summary of Protein Domains of TANGO 244, TANGO 246, TANGO 
275, and TANGO 300 and MANGO 245 



10 



20 



Protein 


dignal reptiae 


Mature 
Protein 


Extracellular 
Domain 


Transmembrane 
Domain 


Cytoplasmic 
Domain 


TANGO 244 


AA 1-26 of 


AA 27-162 of 


AA 27-119 of 


A A 120-142 of 


AA 143-162 of 


Human 


SEQ IDNO:2 


SEQ IDN0:2 


SEQIDNO:2 


SEQ ID NO 2 


SEQ IDN0:2 




SEQ ID NO 26 


SEQ IDNO:27 


SEQIDN0.75 


SEQ ID NO 76 


SEQ ID NO:77 


TANGO 246 






AA 1-305 of 


AA 306-323 of 


AA 324-329 of 


Human 








SEQIDNOS 
SEQ ID NO 57 


SEQ ID NO 5 
SEQ ID NO 58 


SEQ ID N0:5 
SEQIDNO:59 


TANGO 275 


A A 1-29 of 


AA 30-1303 of 


_ 






Human 


SEQ ID NO 8 
SEO ID NO 60 


SEQ ID NO 8 
SEQ ID NO 61 








TANGO 300 


AA 1-20 of 


AA 21-361 of 


AA 2 1-304 of 


AA 305-321 of 


AA 322-361 of 


Human 


SEQ ID NO: 14 


SEQ ID NO 14 


SEQ ID NO 14 


SEQ ID NO 14 


SEQ ID NO: 14 




SEQ ID NO 62 


SEQ ID NO 63 


SEQ ID NO 85 


SEQ ID NO 86 


SEQ IDNO:87 


TANGO 300 


AA 1-19 of 


A A 20-385 of 


AA 20-318 of 


AA 319-335 of 


AA 336-385 of 


Mouse 


SEQ ID NO 17 


SEQ ID NO 17 


SEQ ID NO: 17 


SEQ ID NO 17 


SEQ ID NO: 17 




SEQ ID NO 64 


SEQ ID NO:65 


SEQ ID NO 88 


SEQ ID NO 89 


SEQ ID NO:90 


MANGO 245 


AA 1-16 of 


A A 17-348 of 


AA 17-141 of 


AA 142-158 of 


AA 159-348 of 


Human 


SEQ ID NO 20 


SEQ ID NO 20 


SEQ ID NO:20 


SEQ ID NO 20 


SEQ ID NO:20 




SEQ ID NO 66 


SEQ ID NO:67 


SEQ ID NO: 78 


SEQ ID NO 79 


SEQIDNO:80 


MANGO 245 


AA ]-l6of 


AA 17-329 of 








Monkey 


SEQ ID NO-23 
SEQ ID NO:68 


SEQ ID NO 23 
SEQ ID NO:69 








MANGO 245 


AA 1-16 of 


AA 17-199 of 








Mouse 


SEQ 1DN0:91 


SEQ ID N0:91 









Deposit Information 

Clone Athua62f9, which encodes human TANGO 244, was deposited, as part 
of a composite deposit, as EpT244 with the American Type Culture Collection 
25 (ATCC® 10801 University Boulevard, Manassas, VA 201 10-2209) on April 21, 1999 
and assigned Accession Number 207223. 

Clone Athsa34d2, which encodes human TANGO 246, was deposited, as part 
of a composite deposit, as EpT246 with the American Type Culture Collection 
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(ATCC® 10801 University Boulevard, Manassas, VA 201 10-2209) on April 21, 1999 
and assigned Accession Number 207223. 

Clone Athbbl9dl, which encodes human TANGO 275, was deposited, as part 
of a composite deposit, as EpT275 with the American Type Culture Collection 
5 (ATCC® 10801 University Boulevard, Manassas, VA 201 10-2209) on April 21, 1999 
and assigned Accession Number 207220. 

Clone AthX672i5, which encodes human TANGO 300, was deposited as 
EpT300 with the American Type Culture Collection (ATCC® 10801 University 
Boulevard, Manassas, VA 20110-2236) on June 30, 1999, and assigned Accession 
10 Number PTA-293. 

Clone Alhbabl65e5, which encodes human MANGO 245, was deposited, as 
part of a composite deposit, as EpM245 with the American Type Culture Collection 
(ATCC® 10801 University Boulevard, Manassas, VA 20110-2209) on April 21, 1999 
and assigned Accession Number 207223. 
15 Clone Alkbd75hl , which encodes monkey MANGO 245, was deposited as 

EpK245 with the American Type Culture Collection (ATCC® 10801 University 
Boulevard, Manassas, VA 201 10-2209) on June 18, 1999 and assigned Accession 
Number PTA-248. 

The clones containing cDNA molecules encoding human TANGO 244, human 
20 TANGO 246, and human MANGO 245 were deposited with the American Type 

Culture Collection (ATCC® 10801 University Boulevard, Manassas, VA 201 10-2209) 
on April 21, 1999 as Accession Number 207223, as part of a composite deposit 
representing a mixture of five strains, each carrying one recombinant plasmid 
harboring a particular cDNA clone. 
25 To distinguish the strains and isolate a strain harboring a particular cDNA 

clone, an aliquot of the mixture can be streaked out to single colonies on nutrient 
medium (e.g., LB plates) supplemented with lOOjig/ml ampicillin, single colonies 
grown, and then plasmid DNA extracted using a standard minipreparation procedure. 
Next, a sample of the DNA minipreparation can be digested with a combination of the 
30 restriction enzymes Sal I, Not I, and Sac II and the resultant products resolved on a 
0.8% agarose gel using standard DNA electrophoresis conditions. The digest liberates 
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fragments as follows: human TANGO 244 (1.5 kB), human TANGO 246 (2.0 kB), 
human MANGO 245 (0 .7 kB and 0 .65 kB; human MANGO 245 has a Sac II cut site 
at about bp 693). The identity of the strains can be inferred from the fragments 
Hberated. 

5 Various aspects of the invention are described in further detail in the following 

subsections. 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that 

10 encode a polypeptide of the invention or a biologically active portion thereof, as well 
as nucleic acid molecules sufficient for use as hybridization probes to identify nucleic 
acid molecules encoding a polypeptide of the invention and fragments of such nucleic 
acid molecules suitable for use as PGR primers for the amplification or mutation of 
nucleic acid molecules. As used herein, the term "nucleic acid molecule" is intended 

15 to include DNA molecules {e,g,, cDNA or genomic DNA) and RNA molecules {e.g., 
mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The 
nucleic acid molecule can be single-stranded or double-stranded, but preferably is 
double-stranded DNA. 

An "isolated" nucleic acid molecule is one which is separated from other 

20 nucleic acid molecules which are present in the natural source of the nucleic acid 
molecule. Preferably, an "isolated" nucleic acid molecule is free of sequences 
(preferably protein encoding sequences) which naturally flank the nucleic acid (i.e., 
sequences located at the 5' and 3' ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. In other embodiments, the 

25 "isolated" nucleic acid is free of intron sequences. For example, in various 

embodiments, the isolated nucleic acid molecule can contain less than about 5 kB, 4 
kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of nucleotide sequences which naturally flank 
the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is 
derived. Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, 

30 can be substantially free of other cellular material, or culture medium when produced 
by recombinant techniques, or substantially free of chemical precursors or other 
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chemicals when chemically synthesized. In one embodiment, the nucleic acid 
molecules of the invention comprise a contiguous open reading frame encoding a 
polypeptide of the invention. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
5 having the nucleotide sequence of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 
19, 22, 24, or 25, or a complement thereof, can be isolated using standard molecular 
biology techniques and the sequence information provided herein. Using all or a 
portion of the nucleic acid sequences of SEQ ID NO: 1, 3, 4, 6. 7, 9, 10, 12, 13, 15, 
16, 18, 19, 22, 24, or 25 as a hybridization probe, nucleic acid molecules of the 

10 invention can be isolated using standard hybridization and cloning techniques {e,g., as 
described in Sambrook et al., eds., Molecular Cloning: A Laboratory Manual, 2nd 
ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989). 

A nucleic acid molecule of the invention can be amphfied using cDNA, 

1 5 mRNA or genomic DNA as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified 
can be cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to all or a portion of a nucleic acid 
molecule of the invention can be prepared by standard synthetic techniques, e.g., 

20 using an automated DNA synthesizer. 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the nucleotide 
sequence of SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22, 24, or 25, or a 
portion thereof A nucleic acid molecule which is complementary to a given 

25 nucleotide sequence is one which is sufficiently complementary to the given 

nucleotide sequence that it can hybridize to the given nucleotide sequence thereby 
forming a stable duplex. 

Moreover, a nucleic acid molecule of the invention can comprise only a 
portion of a nucleic acid sequence encoding a full length polypeptide of the invention 

30 for example, a fragment which can be used as a probe or primer or a fragment 
encoding a biologically active portion of a polypeptide of the invention. The 
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nucleotide sequence determined from the cloning one gene allows for the generation 
of probes and primers designed for use in identifying and/or cloning homologues in 
other cell types, e.g., from other tissues, as well as homologues from other mammals. 
The probe/primer typically comprises substantially purified oligonucleotide. The 
5 oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 12, preferably about 25, more preferably 
about 50, 75, 100, 125, 150, 175, 200, 250, 300, 350 or 400 consecutive nucleotides 
of the sense or anti-sense sequence of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 
18,19, 22, 24, or 25, or of a naturally occurring mutant of SEQ ID NO: 1 , 3, 4, 6, 7, 9, 

10 10. 12, 13, 15, 16, 18, 19, 22, 24, or 25. 

Probes based on the sequence of a nucleic acid molecule of the invention can 
be used to detect transcripts or genomic sequences encoding the same protein 
molecule encoded by a selected nucleic acid molecule. The probe comprises a label 
group attached thereto, e.g., a radioisotope, a fluorescent compound, an enzyme, or an 

15 enzyme co-factor. Such probes can be used as part of a diagnostic test kit for 

identifying cells or tissues which mis-express the protein, such as by measuring levels 
of a nucleic acid molecule encoding the protein in a sample of cells from a subject, 
e.g., detecting mRNA levels or determining whether a gene encoding the protein has 
been mutated or deleted, 

20 A nucleic acid fragment encoding a biologically active portion of a 

polypeptide of the invention can be prepared by isolating a portion of any of SEQ ID 
N0;1, 3,4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22,24, or 25, expressing the encoded 
portion of the polypeptide protein (e.g., by recombinant expression in vitro) and 
assessing the activity of the encoded portion of the polypeptide. 

25 The invention further encompasses nucleic acid molecules that differ from the 

nucleotide sequence of SEQ ID N0:1, 3, 4, 6, 7,9, 10, 12, 13, 15, 16, 18, 19, 22, 24, 
or 25, due to degeneracy of the genetic code and thus encode the same protein as that 
encoded by the nucleotide sequence of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 
18, 19, 22, 24, or 25. 

30 In addition to the nucleotide sequences of SEQ ID NO: 1,3,4, 6, 7, 9, 10, 12, 

13, 15, 16, 18, 19, 22, 24, or 25, it will be appreciated by those skilled in the art that 
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DNA sequence polymorphisms that lead to changes in the amino acid sequence may 
exist within a population {e.g., the human population). Such genetic polymorphisms 
may exist among individuals within a population due to natural allelic variation. An 
allele is one of a group of genes which occur alternatively at a given genetic locus. 
5 As used herein, the phrase "allelic variant" refers to a nucleotide sequence which 
occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. As 
used herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a polypeptide of the invention. Such 
natural allelic variations can typically result in 1-5% variance in the nucleotide 

10 sequence of a given gene. Alternative alleles can be identified by sequencing the gene 
of interest in a number of different individuals. This can be readily carried out by 
using hybridization probes to identify the same genetic locus in a variety of 
individuals. Any and all such nucleotide variations and resulting amino acid 
polymorphisms or variations that are the result of natural allelic variation and that do 

1 5 not alter the functional activity are intended to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding proteins of the invention from 
other species (homologues), which have a nucleotide sequence which differs from that 
of the human protein described herein are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to natural allelic variants and 

20 homologues of a cDNA of the invention can be isolated based on their identity to the 
human nucleic acid molecule disclosed herein using the human cDNAs, or a portion 
thereof, as a hybridization probe according to standard hybridization techniques under 
stringent hybridization conditions. For example, a cDNA encoding a soluble form of 
a membrane-bound protein of the invention isolated based on its hybridization to a 

25 nucleic acid molecule encoding all or part of the membrane-bound form. Likewise, a 
cDNA encoding a membrane-bound form can be isolated based on its hybridization to 
a nucleic acid molecule encoding all or part of the soluble form. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 300 (325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 800, 

30 900, 1000, or 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 3000, 3200, 
3400, 3600, 3800, 4000, or 4200) nucleotides in length and hybridizes under stringent 
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conditions to the nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID N0:1, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 
19, 22, 24, or 25, or a complement thereof 

As used herein, the term "hybridizes under stringent conditions" is intended to 
5 describe conditions for hybridization and washing under which nucleotide sequences 
at least 60% (65%, 70%, preferably 75% or more) identical to each other typically 
remain hybridized to each other. Such stringent conditions are known to those skilled 
in the art and can be found in Current Protocols in Molecular Biology y John Wiley & 
Sons, N.Y. (1989), 6.3.1-6.3.6, which describes aqueous and non-aqueous methods, 

10 either of which can be used. Another preferred, non-limiting example of stringent 
hybridization conditions are hybridization in 6X sodium chloride/sodium citrate 
(SSC) at about 45°C, followed by one or more washes in 2.0 X SSC at 50° C. (low 
stringency) or 0.2 X SSC, 0.1% SDS at 50-65°C (high stringency). Another preferred 
example of stringent hybridization conditions are hybridization in 6X sodium 

15 chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2X 
SSC, 0.1% SDS at 50°C. Another example of stringent hybridization conditions are 
hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by 
one or more washes in 0.2X SSC, 0. 1 % SDS at 55°C. A further example of stringent 
hybridization conditions are hybridization in 6X sodium chloride/sodium citrate 

20 (SSC) at about 45°C, followed by one or more washes in 0.2X SSC, 0.1% SDS at 
60°C. Preferably, stringent hybridization conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2X 
SSC, 0.1% SDS at 65°C. Particularly preferred stringency conditions (and the 
conditions that should be used if the practitioner is uncertain about what conditions 

25 should be applied to determine if a molecule is within a hybridization limitation of the 
invention) are 0.5M Sodium Phosphate, 7% SDS at 65°C, followed by one or more 
washes at 0.2X SSC, 1% SDS at 65°C. In one embodiment, an isolated nucleic acid 
molecule of the invention that hybridizes under stringent conditions to the sequence of 
SEQ ID N0:1, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22, 24, or 25, or a complement 

30 thereof, corresponds to a naturally-occurring nucleic acid molecule. As used herein, a 
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"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule 
having a nucleotide sequence that occurs in nature {e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants of a nucleic acid molecule of 
the invention sequence that may exist in the population, the skilled artisan will further 
5 appreciate that changes can be introduced by mutation thereby leading to changes in 
the amino acid sequence of the encoded protein, without altering the biological 
activity of the protein. For example, one can make nucleotide substitutions leading to 
amino acid substitutions at "non-essential" amino acid residues. A "non-essential" 
amino acid residue is a residue that can be altered from the wild-type sequence 

10 without altering the biological activity, whereas an "essential" amino acid residue is 
required for biological activity. For example, amino acid residues that are not 
conserved or only semi-conserved among homologues of various species may be non- 
essential for activity and thus would be likely targets for alteration. Alternatively, 
amino acid residues that are conserved among the homologues of various species 

15 {e.g., murine and human) may be essential for activity and thus would not be likely 
targets for alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues 
that are not essential for activity. Such polypeptides differ in amino acid sequence 

20 from SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, yet retain biological activity. In 
one embodiment, the isolated nucleic acid molecule includes a nucleotide sequence 
encoding a protein that includes an amino acid sequence that is at least about 45% 
identical, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ 
IDN0s:2, 5,8, 11, 14, 17, 20, 23, or 91. 

25 An isolated nucleic acid molecule encoding a variant protein can be created by 

introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of SEQ IDNO:l,3,4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22,24, 
or 25, such that one or more amino acid substitutions, additions or deletions are 
introduced into the encoded protein. Mutations can be introduced by standard 

30 techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. 
Briefly, PCR primers are designed that delete the trinucleotide codon of the amino 
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acid to be changed and replace it with the trinucleotide codon of the amino acid to be 
included. This primer is used in the PGR amplification of DNA encoding the protein 
of interest. This fragment is then isolated and inserted into the full length cDNA 
encoding the protein of interest and expressed recombinantly. The resulting protein 
5 now includes the amino acid replacement. 

Preferably, conservative amino acid substitutions are made at one or more 
predicted non-essential amino acid residues. Conservative replacements are those that 
take place within a family of amino acids that are related in their side chains. 
Genetically encoded amino acids are can be divided into four families: (1) acidic = 

10 aspartate, glutamate; (2) basic = lysine, arginine, histidine; (3) nonpolar alanine, 
valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and (4) 
uncharged polar === glycine, asparagine, glutamine, cysteine, serine, threonine, 
tyrosine. In similar fashion, the amino acid repertoire can be grouped as (1) acidic = 
aspartate, glutamate; (2) basic = lysine, arginine histidine, (3) aliphatic = glycine, 

15 alanine, valine, leucine, isoleucine, serine, threonine, with serine and threonine 

optionally be grouped separately as aliphatic-hydroxyl; (4) aromatic = phenylalanine, 
tyrosine, tryptophan; (5) amide = asparagine, glutamine; and (6) sulfur -containing = 
cysteine and methionine. (See, for example, Biochemistry, 4th ed., Ed. by L. Stryer, 
WH Freeman and Co.: 1995). 

20 Alternatively, mutations can be introduced randomly along all or part of the 

coding sequence, such as by saturation mutagenesis, and the resultant mutants can be 
screened for biological activity to identify mutants that retain activity. Following 
mutagenesis, the encoded protein can be expressed recombinantly and the activity of 
the protein can be determined. 

25 In a preferred embodiment, a mutant polypeptide that is a variant of a 

polypeptide of the invention can be assayed for: (1) the ability to form protein-protein 
interactions with proteins in a signaling pathway of the polypeptide of the invention; 
(2) the ability to bind a ligand of the polypeptide of the invention; or (3) the ability to 
bind to an intracellular target protein of the polypeptide of the invention. In yet 

30 another preferred embodiment, the mutant polypeptide can be assayed for the ability 
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to modulate cellular proliferation, cellular migration or chemotaxis, or cellular 
differentiation. 

The present invention encompasses antisense nucleic acid molecules, i.e., 
molecules which are complementary to a sense nucleic acid encoding a polypeptide of 
5 the invention, e.g., complementary to the coding strand of a double-stranded cDNA 
molecule or complementary to an mRNA sequence. Accordingly, an antisense 
nucleic acid can hydrogen bond to a sense nucleic acid. The antisense nucleic acid 
can be complementary to an entire coding strand, or to only a portion thereof, e.g., all 
or part of the protein coding region (or open reading frame). An antisense nucleic 

10 acid molecule can be antisense to all or part of a non-coding region of the coding 
strand of a nucleotide sequence encoding a polypeptide of the invention. The non- 
coding regions ("5' and 3' untranslated regions") are the 5' and 3' sequences which 
flank the coding region and are not translated into amino acids. 

An antisense ohgonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 

15 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis and enzymatic ligation reactions using 
procedures known in the art. For example, an antisense nucleic acid (eg., an 
antisense oligonucleotide) can be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to increase the biological 

20 stability of the molecules or to increase the physical stability of the duplex formed 
between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and 
acridine substituted nucleotides can be used. Examples of modified nucleotides 
which can be used to generate the antisense nucleic acid include 5-fluorouracil, 5- 
bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 

25 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, p-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methyhnosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, P- 

30 D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2- 
methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
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pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4- 
thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid 
(v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 
diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
5 using an expression vector into which a nucleic acid has been subcloned in an 

antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of 
an antisense orientation to a target nucleic acid of interest, described further in the 
following subsection). 

The antisense nucleic acid molecules of the invention are typically 

10 administered to a subject or generated in situ such that they hybridize with or bind to 
cellular mRNA and/or genomic DNA encoding a selected polypeptide of the 
invention to thereby inhibit expression, e.g., by inhibiting transcription and/or 
translation. The hybridization can be by conventional nucleotide complementarity to 
form a stable duplex, or, for example, in the case of an antisense nucleic acid 

15 molecule which binds to DNA duplexes, through specific interactions in the major 
groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. 
Alternatively, antisense nucleic acid molecules can be modified to target selected cells 
and then administered systemically. For example, for systemic administration, 

20 antisense molecules can be modified such that they specifically bind to receptors or 
antigens expressed on a selected cell surface, e.g., by linking the antisense nucleic 
acid molecules to peptides or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also be delivered to cells using the 
vectors described herein. To achieve sufficient intracellular concentrations of the 

25 antisense molecules, vector constructs in which the antisense nucleic acid molecule is 
placed under the control of a strong pol II or pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an a-anomeric 
nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, contrary to the usual P-units, 

30 the strands run parallel to each other (Gaultier et al., 1987, Nucleic Acids Res, 
15:6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o- 
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methylribonucleotide (Inoue et al., 1987, Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al., 1987, FEBS Lett. 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic RNA 
molecules with ribonuclease activity which are capable of cleaving a single-stranded 
5 nucleic acid, such as an nxRNA, to which they have a complementary region. Thus, 
ribozymes {e.g., hammerhead ribozymes; described in Haselhoff and Gerlach, 1988, 
Nature 334:585-591) can be used to catalytically cleave mRNA transcripts to thereby 
inhibit translation of the protein encoded by the mRNA. A ribozyme having 
specificity for a nucleic acid molecule encoding a polypeptide of the invention can be 

10 designed based upon the nucleotide sequence of a cDNA disclosed herein. For 
example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which 
the nucleotide sequence of the active site is complementary to the nucleotide sequence 
to be cleaved in a Cech et al. U.S. Patent No. 4,987,071 ; and Cech et al. U.S. Patent 
No. 5,1 16,742. Alternatively, an mRNA encoding a polypeptide of the invention can 

15 be used to select a catalytic RNA having a specific ribonuclease activity from a pool 
of RNA molecules. See, e^., Bartel and Szostak, 1993, 261:141 1-1418, 

The invention also encompasses nucleic acid molecules which form triple 
helical structures. For example, expression of a polypeptide of the invention can be 
inhibited by targeting nucleotide sequences complementary to the regulatory region of 

20 the gene encoding the polypeptide {e.g., the promoter and/or enhancer) to form triple 
helical structures that prevent transcription of the gene in target cells. See generally 
Helene, \99\, Anticancer Drug Des, 6(6):569-84; Helene, 1992, Ann. NY. Acad. Sci. 
660:27-36; andMaher, 1992, Bioassays 14(12):807-15. 

In various embodiments, the nucleic acid molecules of the invention can be 

25 modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the 
stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic 
acids {see Hyrup et al., 1996, Bioorganic & Medicinal Chemistry 4(1): 5-23). As 
used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, 

30 e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The 
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neutral backbone of PNAs has been shown to allow for specific hybridization to DNA 
and RNA under conditions of low ionic strength. The synthesis of PNA oligomers 
can be performed using standard solid phase peptide synthesis protocols as described 
in Hyrup et al., 1996, supra\ Perry-O'Keefe et al., 1996, Proc. Natl Acad. ScL USA 
5 93: 14670-675. 

PNAs can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting 
rephcation. PNAs can also be used, e.g., in the analysis of single base pair mutations 

10 in a gene by, e.g.y PNA directed PGR clamping; as artificial restriction enzymes when 
used in combination with other enzymes, e.g., SI nucleases (Hyrup, 1996, supra; or 
as probes or primers for DNA sequence and hybridization (Hyrup, 1996, supra; Perry- 
O'Keefe et al., 1996, Proc. Natl, Acad. Sci, USA 93: 14670-675). 

In another embodiment, PNAs can be modified, e.g., to enhance their stability 

1 5 or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of 
drug delivery known in the art. For example, PNA-DNA chimeras can be generated 
which may combine the advantageous properties of PNA and DNA. Such chimeras 
allow DNA recognition enzymes, e.g., RNase H and DNA polymerases, to interact 

20 with the DNA portion while the PNA portion would provide high binding affinity and 
specificity, PNA-DNA chimeras can be linked using linkers of appropriate lengths 
selected in terms of base stacking, number of bonds between the nucleobases, and 
orientation (Hyrup, 1996, supra). The synthesis of PNA-DNA chimeras can be 
performed as described in Hyrup, 1996, supra, and Finn et al., 1996, Nucleic Acids 

25 Res. 24(17):3357-63. For example, a DNA chain can be synthesized on a solid 

support using standard phosphoramidite coupling chemistry and modified nucleoside 
analogs. Compounds such as 5'-(4-methoxytrityl)amino-5'-deoxy-thymidine 
phosphoramidite can be used as a link between the PNA and the 5' end of DNA (Mag 
et al., 1989, Nucleic Acids Res. 17:5973-88). PNA monomers are then coupled in a 

30 stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' 
DNA segment (Finn et al., 1996, Nucleic Acids Res. 24(17):3357-63). Altematively, 
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chimeric molecules can be synthesized with a 5' DNA segment and a 3' PNA segment 
(Peterser et al., 1975, Bioorganic Med. Chem. Lett. 5:1119-11 124). 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
5 transport across the cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. 
Sci. USA 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad Sci, USA 84:648- 
652; PCX Publication No. WO 88/09810) or the blood-brain barrier {see, e.g, PCT 
Publication No. WO 89/101 34). In addition, oligonucleotides can be modified with 
hybridization-triggered cleavage agents (see, e.g., Krol et al., 1988, Bio/Techniques 
10 6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549). To 
this end, the oligonucleotide may be conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

15 II. Isolated Proteins and Antibodies 

One aspect of the invention pertains to isolated proteins, and biologically 
active portions thereof, as well as polypeptide fragments suitable for use as 
immunogens to raise antibodies directed against a polypeptide of the invention. In 
one embodiment, the native polypeptide can be isolated from cells or tissue sources 

20 by an appropriate purification scheme using standard protein purification techniques. 
In another embodiment, polypeptides of the invention are produced by recombinant 
DNA techniques. Alternative to recombinant expression, a polypeptide of the 
invention can be synthesized chemically using standard peptide synthesis techniques. 
An "isolated" or "purified" protein or biologically active portion thereof is 

25 substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical 
preciu*sors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of protein in which the 
protein is separated from cellular components of the cells from which it is isolated or 

30 recombinantly produced. Thus, protein that is substantially free of cellular material 
includes preparations of protein having less than about 30%, 20%, 10%, or 5% (by 
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dry weight) of heterologous protein (also referred to herein as a "contaminating 
protein"). When the protein or biologically active portion thereof is recombinantly 
produced, it is also preferably substantially free of culture medium, i.e., culture 
medium represents less than about 20%, 10%, or 5% of the volume of the protein 
5 preparation. When the protein is produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other chemicals, i.e., it is separated from 
chemical precursors or other chemicals which are involved in the synthesis of the 
protein. Accordingly such preparations of the protein have less than about 30%, 20%, 
10%>, 5% (by dry weight) of chemical precursors or compounds other than the 

10 polypeptide of interest. The term "pure" or "isolated*' as used herein preferably has 
the same numerical limits as "purified" or ''isolated*' immediately above. "Isolated" 
and "purified" do not encompass either natural materials in their native state or natural 
materials that have been separated into components {e.g., in an acrylamide gel) but 
not obtained either as pure (e.g., lacking contaminating proteins, or chromatography 

15 reagents such as denaturing agents and polymers, e.g., acrylamide or agarose) 

substances or solutions. In preferred embodiments, purified or isolated preparations 
will lack any contaminating proteins from the same animal from which the protein is 
normally produced, as can be accomplished by recombinant expression of, for 
example, a human protein in a non-human cell. 

20 Biologically active portions of a polypeptide of the invention include 

polypeptides comprising amino acid sequences sufficiently identical to or derived 
from the amino acid sequence of the protein (e.g., the amino acid sequence shown in 
any of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91), which include fewer amino 
acids than the full length protein, and exhibit at least one activity of the 

25 corresponding full-length protein. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the corresponding protein. A 
biologically active portion of a protein of the invention can be a polypeptide which is, 
for example, 10, 25, 50, 100 or more amino acids in length. Moreover, other 
biologically active portions, in which other regions of the protein are deleted, can be 

30 prepared by recombinant techniques and evaluated for one or more of the functional 
activities of the native form of a polypeptide of the invention. 
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Preferred polypeptides have the amino acid sequence of SEQ ED N0s:2, 5, 8, 
1 1, 14, 17, 20, 23, or 91 . Other useful proteins are substantially identical (e.g., at least 
about 45%, preferably 55%, 65%, 75%, 85%, 95%, or 99%) to any of SEQ ID NOs: 
2, 5, 8, 1 1, 14, 17, 20, 23, or 91 and retain the functional activity of the protein of the 
5 corresponding naturally-occurring protein yet differ in amino acid sequence due to 
natural allelic variation or mutagenesis. 

To determine the percent identity of two amino acid sequences or of two 
nucleic acids, the sequences are aligned for optimal comparison purposes {e.g., gaps 
can be introduced in the sequence of a first amino acid or nucleic acid sequence for 
10 optimal alignment with a second amino or nucleic acid sequence). The amino acid 
residues or nucleotides at corresponding amino acid positions or nucleotide positions 
are then compared. When a position in the first sequence is occupied by the same 
amino acid residue or nucleotide as the corresponding position in the second 
sequence, then the molecules are identical at that position. The percent identity 
1 5 between the two sequences is a function of the number of identical positions shared 
by the sequences (i.e., % identity = # of identical positions/total # of positions (e.g., 
overlapping positions) x 100). In one embodiment the two sequences are the same 
length. 

The determination of percent identity between two sequences can be 
20 accomplished using a mathematical algorithm. A preferred, non-limiting example of 
a mathematical algorithm utilized for the comparison of two sequences is the 
algorithm of Karlin and Altschul, 1990, Proc, Natl. Acad. ScL USA 87:2264-2268, 
modified as in Karlin and Altschul, 1993, Proc. Natl. Acad. Sci. USA 90:5873-5877, 
Such an algorithm is incorporated into the NBLAST and XBLAST programs of 
25 Altschul, et al, 1990, 1 MoL Biol 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
nucleotide sequences homologous to a nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score = 50, 
wordlength = 3 to obtain amino acid sequences homologous to a protein molecules of 
30 the invention. To obtain gapped alignments for comparison purposes, Gapped 
BLAST can be utilized as described in Altschul et al., 1997, Nucleic Acids Res. 
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25:3389-3402. Alternatively, PSI-Blast can be used to perform an iterated search 
which detects distant relationships between molecules {Id.), When utilizing BLAST, 
Gapped BLAST, and PSI-Blast programs, the default parameters of the respective 
programs {e.g., XBLAST and NBLAST) can be used. See 
5 http://www.ncbi.nlm.mh.gov. Another preferred, non-limiting example of a 

mathematical algorithm utilized for the comparison of sequences is the algorithm of 
Myers and Miller, 1988, CABIOS A'AlAl . Such an algorithm is incorporated into the 
ALIGN program (version 2.0) which is part of the GCG sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, a 
10 PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can 
be used. 

The percent identity between two sequences can be determined using 
techniques similar to those described above, with or without allowing gaps. In 
calculating percent identity, typically exact matches are counted. 

15 The invention also provides chimeric or fusion proteins. As used herein, a 

"chimeric protein" or "fusion protein" comprises all or part (preferably biologically 
active) of a polypeptide of the invention operably hnked to a heterologous 
polypeptide (i.e., a polypeptide other than the same polypeptide of the invention). 
Within the fusion protein, the term "operably linked" is intended to indicate that the 

20 polypeptide of the invention and the heterologous polypeptide are fused in-frame to 
each other. The heterologous polypeptide can be fused to the N-terminus or C- 
terminus of the polypeptide of the invention. 

One useful fusion protein is a GST fusion protein in which the polypeptide of 
the invention is fused to the C-terminus of GST sequences. Such fusion proteins can 

25 facilitate the purification of a recombinant polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal 
peptide at its N-terminus. For example, the native signal peptide of a polypeptide of 
the invention can be removed and replaced with a signal peptide from another protein. 
For example, the gp67 secretory sequence of the baculovirus envelope protein can be 

30 used as a heterologous signal peptide {Current Protocols in Molecular Biology, 
Ausubel et al., eds., John Wiley & Sons, 1992). Other examples of eukaryotic 

60 



wo 01/00672 



PCT/USOO/18184 



heterologous signal peptides include the secretory sequences of melittin and human 
placental alkaline phosphatase (Stratagene; La Jolla, California). In yet another 
example, useful prokaryotic heterologous signal peptides include the phoA secretory 
signal (Sambrook et al., supra) and the protein A secretory signal (Pharmacia Biotech; 
5 Piscataway, New Jersey). 

In yet another embodiment, the flision protein is an immunoglobulin fusion 
protein in which all or part of a polypeptide of the invention is fused to sequences 
derived from a member of the immunoglobulin protein family. The immunoglobulin 
fusion proteins of the invention can be incorporated into pharmaceutical compositions 

10 and administered to a subject to inhibit an interaction between a ligand (soluble or 
membrane-bound) and a protein on the surface of a cell (receptor), to thereby suppress 
signal transduction in vivo. The immunoglobulin fusion protein can be used to affect 
the bioavailability of a cognate hgand of a polypeptide of the invention. Inhibition of 
ligand/receptor interaction may be useful therapeutically, both for treating 

1 5 proliferative and differentiative disorders and for modulating {e.g., promoting or 
inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce antibodies directed against a 
polypeptide of the invention in a subject, to purify ligands and in screening assays to 
identify molecules which inhibit the interaction of receptors with ligands. 

20 Chimeric and fusion proteins of the invention can be produced by standard 

recombinant DNA techniques. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 
primers which give rise to complementary overhangs between two consecutive gene 

25 fragments which can subsequently be annealed and reamplified to generate a chimeric 
gene sequence {see, e.g., Ausubel et al.. supra). Moreover, many expression vectors 
are commercially available that already encode a fusion moiety {e.g., a GST 
polypeptide). A nucleic acid encoding a polypeptide of the invention can be cloned 
into such an expression vector such that the fusion moiety is linked in-frame to the 

30 polypeptide of the invention. 
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A signal peptide of a polypeptide of the invention (SEQ ID NOs:26, 60, 62, 
64, 66, or 68) can be used to facilitate secretion and isolation of the secreted protein or 
other proteins of interest. Signal peptides are typically characterized by a core of 
hydrophobic amino acids which are generally cleaved from the mature protein during 
5 secretion in one or more cleavage events. Such signal peptides contain processing 
sites that allow cleavage of the signal peptide from the mature proteins as they pass 
through the secretory pathway. Thus, the invention pertains to the described 
polypeptides having a signal peptide, as well as to the signal peptide itself and to the 
polypeptide in the absence of the signal peptide (i.e., the cleavage products). In one 

10 embodiment, a nucleic acid sequence encoding a signal peptide of the invention can 
be operably linked in an expression vector to a protein of interest, such as a protein 
which is ordinarily not secreted or is otherwise difficult to isolate. The signal peptide 
directs secretion of the protein, such as from a eukaryotic host into which the 
expression vector is transformed, and the signal peptide is subsequently or 

1 5 concurrently cleaved. The protein can then be readily purified from the extracellular 
medium by art recognized methods. Alternatively, the signal peptide can be linked to 
the protein of interest using a sequence which facilitates purification, such as with a 
GST domain. 

In another embodiment, the signal peptides of the present invention can be 
20 used to identify regulatory sequences, e.g., promoters, enhancers, repressors. Since 
signal peptides are the most amino-terminal sequences of a peptide, it is expected that 
the nucleic acids which flank the signal peptide on its amino-terminal side will be 
regulatory sequences which affect transcription. Thus, a nucleotide sequence which 
encodes all or a portion of a signal peptide can be used as a probe to identify and 
25 isolate signal peptides and their flanking regions, and these flanking regions can be 
studied to identify regulatory elements therein. 

The present invention also pertains to variants of the polypeptides of the 
invention. Such variants have an altered amino acid sequence which can function as 
either agonists (mimetics) or as antagonists. Variants can be generated by 
30 mutagenesis, eg,, discrete point mutation or truncation. An agonist can retain 
substantially the same, or a subset, of the biological activities of the naturally 
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occurring form of the protein. An antagonist of a protein can inhibit one or more of 
the activities of the naturally occurring form of the protein by, for example, 
competitively binding to a downstream or upstream member of a cellular signaling 
cascade which includes the protein of interest. Thus, specific biological effects can be 
5 elicited by treatment with a variant of limited function. Treatment of a subject with a 
variant having a subset of the biological activities of the naturally occurring form of 
the protein can have fewer side effects in a subject relative to treatment with the 
naturally occurring form of the protein. 

Modification of the structure of the subject polypeptides can be for such 

10 purposes as enhancing therapeutic or prophylactic efficacy, stability (e.g., ex vivo 
shelf life and resistance to proteolytic degradation in v/vo), or post-translational 
modifications {e.g., to alter phosphorylation pattern of protein). Such modified 
peptides, when designed to retain at least one activity of the naturally-occurring form 
of the protein, or to produce specific antagonists thereof, are considered fiinctional 

1 5 equivalents of the polypeptides described in more detail herein. Such modified 
peptides can be produced, for instance, by amino acid substitution, deletion, or 
addition. 

For example, it is reasonable to expect that an isolated replacement of a 
leucine with an isoleucine or valine, an aspartate with a glutamate, a threonine with a 
20 serine, or a similar replacement of an amino acid with a structurally related amino 
acid (i.e. isosteric and/or isoelectric mutations) will not have a major effect on the 
biological activity of the resulting molecule. 

Whether a change in the amino acid sequence of a peptide results in a 
functional homolog (e.g., functional in the sense that the resulting polypeptide mimics 
25 or antagonizes the wild-type form) can be readily determined by assessing the abihty 
of the variant peptide to produce a response in cells in a fashion similar to the wild- 
type protein, or competitively inhibit such a response. Polypeptides in which more 
than one replacement has taken place can readily be tested in the same manner. 
Variants of a protein of the invention which function as either agonists 
30 (mimetics) or as antagonists can be identified by screening combinatorial libraries of 
mutants, e.g., truncation mutants, of the protein of the invention for agonist or 
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antagonist activity. In one embodiment, a variegated library of variants is generated 
by combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated 
gene library. A variegated library of variants can be produced by, for example, 
enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences 
5 such that a degenerate set of potential protein sequences is expressible as individual 
polypeptides, or alternatively, as a set of larger fusion proteins {e.g., for phage 
display). There are a variety of methods which can be used to produce libraries of 
potential variants of the polypeptides of the invention from a degenerate 
oligonucleotide sequence. Methods for synthesizing degenerate oligonucleotides are 

10 known in the art (see, e.g., Narang, 1983, Tetrahedron 39:3; Itakura et al., 1984, 
Annu. Rev. Biochem. 53:323; Itakura et al, 1984, Science 198:1056; Ike et al., 1983, 
Nucleic Acid ResAVAll). 

In addition, libraries of fragments of the coding sequence of a polypeptide of 
the invention can be used to generate a variegated population of polypeptides for 

15 screening and subsequent selection of variants. For example, a library of coding 
sequence fragments can be generated by treating a double stranded PCR fragment of 
the coding sequence of interest with a nuclease under conditions wherein nicking 
occurs only about once per molecule, denaturing the double stranded DNA, renaturing 
the DNA to form double stranded DNA which can include sense/anti sense pairs from 

20 different nicked products, removing single stranded portions from reformed duplexes 
by treatment with SI nuclease, and ligating the resulting fragment library into an 
expression vector. By this method, an expression library can be derived which 
encodes N-terminal and internal fragments of various sizes of the protein of interest. 
Several techniques are known in the art for screening gene products of 

25 combinatorial libraries made by point mutations or truncation, and for screening 
cDNA libraries for gene products having a selected property. The most widely used 
techniques, which are amenable to high through-put analysis, for screening large gene 
Ubraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting Ubrary of vectors, and expressing the 

30 combinatorial genes under conditions in which detection of a desired activity 
facilitates isolation of the vector encoding the gene whose product was detected. 
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Recursive ensemble mutagenesis (REM), a technique which enhances the frequency 
of functional mutants in the libraries, can be used in combination with the screening 
assays to identify variants of a protein of the invention (Arkin and Yourvan, 1992, 
Proc. Natl. Acad, ScL USA 59:7811-7815; Delgrave et al., 1993, Protein Engineering 
5 6(3):327-331). 

An isolated polypeptide of the invention, or a fragment thereof can be used as 
an immunogen to generate antibodies using standard techniques for polyclonal and 
monoclonal antibody preparation. The full-length polypeptide or protein can be used 
or, alternatively, the invention provides antigenic peptide fragments for use as 
10 immunogens. The antigenic peptide of a protein of the invention comprises at least 8 
(preferably 10, 15, 20, or 30) amino acid residues of the amino acid sequence of SEQ 
ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, and encompasses an epitope of the protein 
such that an antibody raised against the peptide forms a specific immune complex 
with the protein. 

15 Preferred epitopes encompassed by the antigenic peptide are regions that are 

located on the surface of the protein, e.g., hydrophihc regions. Hydropathy plots or 
similar analyses can be used to identify hydrophilic regions. 

An isolated polypeptide of the invention, or a fragment thereof can be used as 
an immunogen to generate antibodies using standard techniques for polyclonal and 

20 monoclonal antibody preparation. The full-length polypeptide or protein can be used 
or, alternatively, the invention provides antigenic peptide fragments for use as 
immunogens. The antigenic peptide of a protein of the invention comprises at least 8 
(preferably 10, 15, 20, or 30) amino acid residues of the amino acid sequence of SEQ 
ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, and encompasses an epitope of the protein 

25 such that an antibody raised against the peptide forms a specific immune complex 
with the protein. 

An immunogen typically is used to prepare antibodies by immunizing a 
suitable subject, {e.g., rabbit, goat, mouse or other mammal). An appropriate 
immunogenic preparation can contain, for example, recombinantly expressed or 
30 chemically synthesized polypeptide. The preparation can further include an adjuvant, 
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such as Freund's complete or incomplete adjuvant, or similar immunostimulatory 
agent. 

Accordingly, another aspect of the invention pertains to antibodies directed 
against a polypeptide of the invention. The term "antibody" as used herein refers to 
5 immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen binding site which specifically binds 
an antigen, such as a polypeptide of the invention, eg., an epitope of a polypeptide of 
the invention. A molecule which specifically binds to a given polypeptide of the 
invention is a molecule which binds the polypeptide, but does not substantially bind 

10 other molecules in a sample, e.g., a biological sample, which naturally contains the 
polypeptide. Examples of immunologically active portions of immunoglobulin 
molecules include F(ab) and F{ab')2 fragments which can be generated by treating the 
antibody with an enzyme such as pepsin. The invention provides polyclonal and 
monoclonal antibodies. The term "monoclonal antibody" or "monoclonal antibody 

15 composition", as used herein, refers to a population of antibody molecules that 

contain only one species of an antigen binding site capable of immunoreacting with a 
particular epitope. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a polypeptide of the invention as an immunogen. Preferred 

20 polyclonal antibody compositions are ones that have been selected for antibodies 
directed against a polypeptide or polypeptides of the invention. Particularly preferred 
polyclonal antibody preparations are ones that contain only antibodies directed against 
a polypeptide or polypeptides of the invention. Particularly preferred immunogen 
compositions are those that contain no other human proteins such as, for example, 

25 immunogen compositions made using a non-human host cell for recombinant 
expression of a polypeptide of the invention. In such a manner, the only human 
epitope or epitopes recognized by the resulting antibody compositions raised against 
this immunogen will be present as part of a polypeptide or polypeptides of the 
invention. 

30 The antibody titer in the immunized subject can be monitored over time by 

standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) 
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using immobilized polypeptide. If desired, the antibody molecules can be isolated 
from the mammal (eg., from the blood) and further purified by well-known 
techniques, such as protein A chromatography to obtain the IgG fraction. 
Alternatively, antibodies specific for a protein or polypeptide of the invention can be 
5 selected for (e.g., partially purified) or purified by, e.g., affinity chromatography. For 
example, a recombinantly expressed and purified (or partially purified) protein of the 
invention is produced as described herein, and covalently or non-covalently coupled 
to a solid support such as, for example, a chromatography column. The column can 
then be used to affinity purify antibodies specific for the proteins of the invention 

10 from a sample containing anfibodies directed against a large number of different 
epitopes, thereby generating a substantially purified antibody composition, i.e., one 
that is substantially free of contaminafing antibodies. By a substanfially purified 
antibody composition is meant, in this context, that the antibody sample contains at 
most only 30% (by dry weight) of contaminating antibodies directed against epitopes 

15 other than those on the desired protein or polypeptide of the invention, and preferably 
at most 20%, yet more preferably at most 10%, and most preferably at most 5% (by 
dry weight) of the sample is contaminating antibodies. A purified antibody 
composition means that at least 99% of the antibodies in the composition are directed 
against the desired protein or polypeptide of the invention. 

20 At an appropriate time after immunization, e.g., when the specific antibody 

titers are highest, antibody-producing cells can be obtained from the subject and used 
to prepare monoclonal antibodies by standard techniques, such as the hybridoma 
technique originally described by Kohler and Milstein, 1975, Nature 256:495-497, the 
human B cell hybridoma technique (Kozbor et al., 1983, Immunol. Today 4:72), the 

25 EBV-hybridoma technique (Cole et al., 1985, Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pgs. 77-96) or trioma techniques. The technology for 
producing hybridomas is well known (see generally Current Protocols in 
Immunology, 1994, Coligan et al. (eds.) John Wiley & Sons, Inc., New York, NY). 
Hybridoma cells producing a monoclonal antibody of the invention are detected by 

30 screening the hybridoma culture supematants for antibodies that bind the polypeptide 
of interest, e.g., using a standard ELISA assay. 
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Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal antibody directed against a polypeptide of the invention can be identified 
and isolated by screening a recombinant combinatorial immunoglobulin library (e.g., 
an antibody phage display library) with the polypeptide of interest. Kits for 
5 generating and screening phage display libraries are commercially available (e.g., the 
Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01 ; and the 
Stratagene SurfZAPJ Phage Display Kit, Catalog No. 2406 12). Additionally, 
examples of methods and reagents particularly amenable for use in generating and 
screening antibody display library can be found in, for example, U.S. Patent No. 

10 5,223,409; PCT Publication No. WO 92/18619; PCT Publication No. WO 91/17271; 
PCT Publication No. WO 92/20791; PCT Pubhcation No. WO 92/15679; PCT 
Publication No. WO 93/01288; PCT Pubhcation No. WO 92/01047; PCT Publication 
No. WO 92/09690; PCT Publication No. WO 90/02809; Fuchs et al., 1991, 
Bio/Technology 9:1370-1372; Hay et al., 1992, Hum. Antibod. Hybridomas 3:81-85; 

15 Huseetal., 1989, &/e//ce 246:1275-1281; Griffiths et al., 1993, EMBO J. 12:725- 
734. 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can 
be made using standard recombinant DNA techniques, are within the scope of the 

20 invention. A chimeric antibody is a molecule in which different portions are derived 
fi-om different animal species, such as those having a variable region derived from a 
murine mAb and a human immunoglobulin constant region. (See, e.g., Cabilly et al., 
U.S. Patent No. 4,816,567; and Boss et al., U.S. Patent No. 4,816,397, which are 
incorporated herein by reference in their entirety). Such chimeric and humanized 

25 monoclonal antibodies can be produced by recombinant DNA techniques known in 
the art, for example using methods described in PCT Publication No. WO 87/02671 ; 
European Patent Application 184,187; European Patent Application 171,496; 
European Patent Application 173,494; PCT Publication No. WO 86/01533; U.S. 
Patent No. 4,816,567; European Patent Application 125,023; Better et al., 1988, 

30 Science 240:1041-1043; Liu et al., 1987, Proc. Nati Acad. Sci. USA 84:3439-3443; 
Liu et al., 1987, J. Immunol. 139:3521-3526; Sun et al., 1987, Proc. Nati Acad. Sci. 
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USA 84:214-218; Nishimura et al., 1987, Cane, Res. 47:999-1005; Wood et al., 1985, 
Nature 314:446-449; and Shaw et al., 1988,/ Natl. Cancer Inst, 80:1553-1559); 
Momson, 1985, Science 229:1202-1207; Oi et al., 1986, Bio/Techniques 4:214; U.S. 
Patent 5,225,539; Jones et al, 1986, Nature 321:552-525; Verhoeyan et al., 1988, 
5 5c/e«ce 239: 1534; and Beidleretal., mS, J. Immunol. 141:4053-4060. 

Completely human antibodies are particularly desirable for therapeutic 
treatment of human patients. Such antibodies can be produced, for example, using 
transgenic mice which are incapable of expressing endogenous immunoglobulin 
heavy and light chains genes, but which can express human heavy and light chain 

10 genes. The transgenic mice are inmiunized in the normal fashion with a selected 

antigen, e^., all or a portion of a polypeptide of the invention. Monoclonal antibodies 
directed against the antigen can be obtained using conventional hybridoma 
technology. The human immunoglobulin transgenes harbored by the transgenic mice 
rearrange during B cell differentiation, and subsequently undergo class switching and 

1 5 somatic mutation. Thus, using such a technique, it is possible to produce 
therapeutically useful IgG, IgA and IgE antibodies. For an overview of this 
technology for producing human antibodies, see Lonberg and Huszar (1995, Int. Rev. 
Immunol. 13:65-93). For a detailed discussion of this technology for producing 
human antibodies and human monoclonal antibodies and protocols for producing such 

20 antibodies, see, e.g., U.S. Patent 5,625,126; U.S. Patent 5,633,425; U.S. Patent 

5,569,825; U.S. Patent 5,661,016; and U.S. Patent 5,545,806. In addition, companies 
such as Abgenix, Inc. (Freemont, CA), can be engaged to provide human antibodies 
directed against a selected antigen using technology similar to that described above. 
Completely human antibodies which recognize a selected epitope can be 

25 generated using a technique referred to as "guided selection." In this approach a 
selected non-human monoclonal antibody, e.g.^ a murine antibody, is used to guide 
the selection of a completely human antibody recognizing the same epitope. (Jespers 
et al., \99A, Bio/technology 12:899-903). 

An antibody directed against a polypeptide of the invention {e.g., monoclonal 

30 antibody) can be used to isolate the polypeptide by standard techniques, such as 

affinity chromatography or immunoprecipitation. Moreover, such an antibody can be 
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used to detect the protein {e.g., in a cellular lysate or cell supernatant) in order to 
evaluate the abundance and pattern of expression of the polypeptide. The antibodies 
can also be used diagnostically to monitor protein levels in tissue as part of a clinical 
testing procedure, e.g., to, for example, determine the efficacy of a given treatment 
5 regimen. Detection can be facilitated by coupling the antibody to a detectable 
substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and 
radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, 8-galactosidase, or acetylcholinesterase; examples of suitable 

10 prosthetic group complexes include streptavidin/biotin and avidin^iotin; examples of 
suitable fluorescent materials include umbeUiferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 

15 suitable radioactive material include ^^^I, ^^'l, or ^H. 

Further, an antibody (or fragment thereof) can be conjugated to a therapeutic 
moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A 
cytotoxin or cytotoxic agent includes any agent that is detrimental to cells. Examples 
include taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, 

20 etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, 
dihydroxy anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1- 
dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and 
puromycin and analogs or homologs thereof. Therapeutic agents include, but are not 
limited to, antimetabolites {e.g. , methotrexate, 6-mercaptopurine, 6-thioguanine, 

25 cytarabine, 5-fluorouracil decarbazine), alkylating agents {e.g., mechlorethamine, 
thioepa chlorambucil, melphalan, carmustine (BSNU) and lomustine (CCNU), 
cyclothosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and cis- 
dichlorodiamine platinum (II) (DDP) cisplatin), anthracyclines {e.g., daunorubicin 
(formerly daunomycin) and doxorubicin), antibiotics (e.g., dactinomycin (formerly 

30 actinomycin), bleomycin, mithramycin, and anthramycin (AMC)), and anti-mitotic 
agents {e.g., vincristine and vinblastine). 
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The conjugates of the invention can be used for modifying a given biological 
response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing a desired biological activity. Such proteins may include, for example, a 
5 toxin such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such 
as tumor necrosis factor, a-interferon, P-interferon, nerve growth factor, platelet 
derived growth factor, tissue plasminogen activator; or, biological response modifiers 
such as, for example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), 
interleukin-6 ("IL-6"), granulocyte macrophage colony stimulating factor ("GM- 

10 CSF"), granulocyte colony stimulating factor ("G-CSF"), or other growth factors. 

Techniques for conjugating such therapeutic moiety to antibodies are well 
known, see, e.g., Amon et al., "Monoclonal Antibodies For Immunotargeting Of 
Drugs In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld 
et al. (eds.), pgs. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For 

15 Drug Delivery", in Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pgs. 
623-53 (Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents 
In Cancer Therapy: A Review", in Monoclonal Antibodies '84: Biological And 
Chnical Applications, Pinchera et al. (eds.), pgs. 475-506 (1985); "Analysis, Results, 
And Future Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer 

20 Therapy", in Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et 
al (eds.), pgs. 303-16 (Academic Press 1985), and Thorpe et al., 1982, "The 
Preparation And Cytotoxic Properties Of Antibody-Toxin Conjugates", Immunol 
Rev., 62:119-58. 

Alternatively, an antibody can be conjugated to a second antibody to form an 
25 antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

Accordingly, in one aspect, the invention provides substantially purified 
antibodies or fragment thereof, and human and non-human antibodies or fragments 
thereof, which antibodies or fragments specifically bind to a polypeptide comprising 
an amino acid sequence selected from the group consisting of: the amino acid 
30 sequenceof any oneof SEQ IDNOs:2, 5, 8, 11, 14, 17, 20, 23, or 91, or an amino 
acid sequence encoded by the cDNA of a clone deposited as ATCC® Accession 

71 



wo 01/00672 



PCT/US(»0/181S4 



Number 207220, 207223, PTA-248 or PTA-293; a fragment of at least 15 amino acid 
residues of the amino acid sequence of any one of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 
23, or 91, an amino acid sequence which is at least 95% identical to the amino acid 
sequence of any one of SEQ ED NOs:2, 5, 8, 11, 14, 17, 20, 23, or 91, wherein the 
5 percent identity is determined using the ALIGN program of the GCG software 
package with a PAM120 weight residue table, a gap length penalty of 12, and a gap 
penalty of 4 or the BESTFIT program with BLOSUM 62 scoring matrix, gap open 
penalty of 12, frame shift penalty of 5, gap extend penalty of 4; and an amino acid 
sequence which is encoded by a nucleic acid molecule which hybridizes to the nucleic 

10 acid molecule consisting of any one of SEQ ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 
18, 19, 22, 24, or 25, or the cDNA of a clone deposited as ATCC® Accession Number 
207220, 207223, PTA-248 or PTA-293, or a complement thereof, under conditions of 
hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0. 1% SDS at 65°C. In 
various embodiments, the substantially purified antibodies of the invention, or 

15 fragments thereof, can be human, non-human, chimeric and/or humanized antibodies. 

In another aspect, the invention provides human or non-human antibodies or 
fragments thereof, which antibodies or fragments specifically bind to a polypeptide 
comprising an amino acid sequence selected from the group consisting of: the amino 
acid sequence of any one of SEQ IDN0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, or an 

20 amino acid sequence encoded by the cDNA of a clone deposited as ATCC® Accession 
Number 207220, 207223, PTA-248 or PTA-293; a fragment of at least 15 amino acid 
residues of the amino acid sequence of any one of SEQ ID N0s:2, 5, 8, 1 1, 14, 17, 20, 
23, or 91, an amino acid sequence which is at least 95% identical to the amino acid 
sequence of any one of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, wherein the 

25 percent identity is determined using the ALIGN program of the GCG software 
package with a PAM120 weight residue table, a gap length penalty of 12, and a gap 
penalty of 4 or the BESTFIT program with a BLOSUM 62 scoring matrix, gap open 
penalty of 12, frame shift penalty of 5, gap extend penalty of 4; and an amino acid 
sequence which is encoded by a nucleic acid molecule which hybridizes to the nucleic 

30 acid molecule consisting of any one of SEQ ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 
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18, 19, 22, 24, or 25, or the cDNA of a clone deposited as ATCC*' Accession Number 
207220, 207223, PTA-248 or PTA-293, or a complement thereof, under conditions of 
hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 65°C. 
Such non-human antibodies can be goat, mouse, sheep, horse, chicken, rabbit, or rat 
5 antibodies. Alternatively, the non-human antibodies of the invention can be chimeric 
and/or humanized antibodies. In addition, the non-human antibodies of the invention 
can be polyclonal antibodies or monoclonal antibodies. 

In still a further aspect, the invention provides monoclonal antibodies or 
fragments thereof, which antibodies or fragments specifically bind to a polypeptide 

10 comprising an amino acid sequence selected from the group consisting of: the amino 
acid sequence of any one of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, or an 
amino acid sequence encoded by the cDNA of a clone deposited as ATCC® Accession 
Number 207220, 207223, PTA-248 or PTA-293; a fragment of at least 15 amino acid 
residues of the amino acid sequence of any one of SEQ ID N0s:2, 5, 8, 1 1, 14, 17, 20, 

15 23, or 91, an amino acid sequence which is at least 95% identical to the amino acid 
sequence of any one of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, wherein the 
percent identity is determined using the ALIGN program of the GCG software 
package with a PAM120 weight residue table, a gap length penalty of 12, and a gap 
penalty of 4 or the BESTFIT program with a BLOSUM 62 scoring matrix, gap open 

20 penalty of 12, frame shift penalty of 5, gap extend penalty of 4; and an amino acid 
sequence which is encoded by a nucleic acid molecule which hybridizes to the nucleic 
acid molecule consisting of any one of SEQ ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 
18, 19, 22, 24, or 25, or the cDNA of a clone deposited as any of ATCC® Accession 
Number 207220, 207223, PTA-248 or PTA-293, or a complement thereof, under 

25 conditions of hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0,1% SDS 
at 65°C. The monoclonal antibodies can be human, humanized, chimeric and/or non- 
human antibodies. 

The substantially purified antibodies or fragments thereof specifically bind to 
a signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a 
30 cytoplasmic domain cytoplasmic membrane of a polypeptide of the invention. In a 
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particularly preferred embodiment, the substantially purified antibodies or fragments 
thereof, the non-human antibodies or fragments thereof, and/or the monoclonal 
antibodies or fragments thereof of the invention specifically bind to a secreted 
sequence or an extracellular domain of the amino acid sequence of SEQ ID N0s:2, 5, 
5 8, 1 1, 14, 17, 20, 23, or 91 . Preferably, the secreted sequence or extracellular domain 
to which the antibody, or fragment thereof binds comprises from about amino acids 
27 to 1 19 of SEQ ID N0:2, amino acid residues 1 to 305 of SEQ ID N0:5, amino 
acid residues 21 to 304 of SEQ ID NO: 14, amino acid residues 20 to 318 of SEQ ID 
N0:17, or amino acid residues 17 to 141 of SEQ ID NO:20. 
10 Any of the antibodies of the invention can be conjugated to a therapeutic 

moiety or to a detectable substance. Non-limiting examples of detectable substances 
that can be conjugated to the antibodies of the invention are an enzyme, a prosthetic 
group, a fluorescent material, a luminescent material, a bioluminescent material, and a 
radioactive material. 

15 The invention also provides a kit containing an antibody of the invention 

conjugated to a detectable substance, and instructions for use. Still another aspect of 
the invention is a pharmaceutical composition comprising an antibody of the 
invention and a pharmaceutically acceptable carrier. In preferred embodiments, the 
pharmaceutical composition contains an antibody of the invention, a therapeutic 

20 moiety, and a pharmaceutically acceptable carrier. 

Still another aspect of the invention is a method of making an antibody that 
specifically recognizes TANGO 244, TANGO 246, TANGO 275, TANGO 300, and 
MANGO 245, the method comprising immunizing a mammal with a polypeptide. 
The polypeptide used as an immunogen comprises an amino acid sequence selected 

25 from the group consisting of: the amino acid sequence of any one of SEQ ID NOs:2, 
5, 8, 1 1, 14, 17, 20, 23, or 91, or an amino acid sequence encoded by the cDNA of a 
clone deposited as ATCC® Accession Number 207220, 207223, PTA-248 or PTA- 
293; a fragment of at least 15 amino acid residues of the amino acid sequence of any 
one of SEQ ID N0s:2, 5, 8, 1 1, 14, 17, 20, 23, or 91 , an amino acid sequence which is 

30 at least 95% identical to the amino acid sequence of any one of SEQ ID NOs:2, 5, 8, 
11, 14, 17, 20, 23, or 91, wherein the percent identity is determined using the ALIGN 
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program of the GCG software package with a PAM120 weight residue table, a gap 
length penalty of 12, and a gap penalty of 4 or the BESTFIT program with a 
BLOSUM 62 scoring matrix, gap open penalty of 12, frame shift penalty of 5, gap 
extend penalty of 4; and an amino acid sequence which is encoded by a nucleic acid 
5 molecule which hybridizes to the nucleic acid molecule consisting of any one of SEQ 
IDNOs:l,3,4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22, 24, or 25, or the cDN A of a 
clone deposited as ATCC® Accession Number 207220, 207223, PTA-248 or PTA- 
293, or a complement thereof, under conditions of hybridization of 6X SSC at 45°C 
and washing in 0.2 X SSC, 0.1% SDS at 65°C. After immunization, a sample is 

10 collected from the mammal that contains an antibody that specifically recognizes 
GPVI. Preferably, the polypeptide is recombinantly produced using a non-human 
host cell Optionally, the antibodies can be ftirther purified fi-om the sample using 
techniques well known to those of skill in the art. The method can ftirther comprise 
producing a monoclonal antibody- producing cell from the cells of the mammal. 

1 5 Optionally, antibodies are collected fi'om the antibody-producing cell. 

III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 
vectors, containing a nucleic acid encoding a polypeptide of the invention (or a 

20 portion thereof). As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double stranded DNA loop into which 
additional DNA segments can be ligated. Another type of vector is a viral vector, 
wherein additional DNA segments can be ligated into the viral genome. Certain 

25 vectors are capable of autonomous replication in a host cell into which they are 

introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal 
mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors, 

30 expression vectors, are capable of directing the expression of genes to which they are 
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operably linked. In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids (vectors). However, the invention is 
intended to include such other forms of expression vectors, such as viral vectors {e.g., 
replication defective retroviruses, adenoviruses and adeno-associated viruses), which 
5 serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid 
of the invention in a form suitable for expression of the nucleic acid in a host cell. 
This means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 

10 operably linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence 
of interest is linked to the regulatory sequence(s) in a manner which allows for 
expression of the nucleotide sequence (e.g., in an in vitro transcription/translation 
system or in a host cell when the vector is introduced into the host cell). The term 

15 "regulatory sequence" is intended to include promoters, enhancers and other 
expression control elements (e.g., polyadenylation signals). Such regulatory 
sequences are described, for example, in Goeddel, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Regulatory 
sequences include those which direct constitutive expression of a nucleotide sequence 

20 in many types of host cell and those which direct expression of the nucleotide 

sequence only in certain host cells {e.g., tissue-specific regulatory sequences). It will 
be appreciated by those skilled in the art that the design of the expression vector can 
depend on such factors as the choice of the host cell to be transformed, the level of 
expression of protein desired, etc. The expression vectors of the invention can be 

25 introduced into host cells to thereby produce proteins or peptides, including fusion 
proteins or peptides, encoded by nucleic acids as described herein. 

The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide of the invention in prokaryotic (e.g., E. coli) or eukaryotic 
cells (eg., insect cells (using baculovirus expression vectors), yeast cells or 

30 mammalian cells). Suitable host cells are discussed further in Goeddel, supra. 
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Alternatively, the recombinant expression vector can be transcribed and translated in 
vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of 
5 either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a 
protein encoded therein, usually to the amino terminus of the recombinant protein. 
Such fusion vectors typically serve three purposes: 1) to increase expression of 
recombinant protein; 2) to increase the solubility of the recombinant protein; and 3) to 
aid in the purification of the recombinant protein by acting as a ligand in affinity 

10 purification. Often, in fusion expression vectors, a proteolytic cleavage site is 

introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to 
purification of the fusion protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase. Typical ftision expression 

15 vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson, 1988, Gene 
67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, 
Piscataway, NJ) which fiise glutathione S-transferase (GST), maltose E binding 
protein, or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include 

20 pTrc (Amann et al, 1988, Gene 69:301-315) and pET 1 Id (Studier et al., Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
California (1990) 60-89). Target gene expression from the pTrc vector relies on host 
RNA polymerase transcription from a hybrid trp-lac fiision promoter. Target gene 
expression from the pET lid vector relies on transcription from a T7 gnlO-lac fiision 

25 promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 
polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression in E. coli is to 
30 express the protein in a host bacteria with an impaired capacity to proteolytically 

cleave the recombinant protein (Gottesman, Gene Expression Technology: Methods in 
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Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et al., 1992, Nucleic Acids Res, 20:21 11-2118). 
5 Such alteration of nucleic acid sequences of the invention can be carried out by 
standard DNA synthesis techniques. 

In another embodiment, the expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari et 
al., 1987, EMBOJ. 6:229-234), pMFa (Kurjan and Herskowitz, 1982, Cell 30:933- 

10 943), pJRY88 (Schultz et al., 1987, Gene 54:1 13-123), pYES2 (Invitrogen 
Corporation, San Diego, CA), and pPicZ (Invitrogen Corp, San Diego, CA). 

Alternatively, the expression vector is a baculovirus expression vector. 
Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., 
Sf 9 cells) include the pAc series (Smith et al, 1983, A/c?/. Cell Biol 3:2156-2165) 

15 and the pVL series (Lucklow and Summers, 1989, Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1987, Nature 329:840) and pMT2PC 
(Kaufinan et al., 1987, EMBOJ. 6:187-195). When used in mammalian cells, the 

20 expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook et al., supra. 
In another embodiment, the recombinant mammalian expression vector is 

25 capable of directing expression of the nucleic acid preferentially in a particular cell 
type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). 
Tissue-specific regulatory elements are known in the art. Non-limiting examples of 
suitable tissue-specific promoters include the albumin promoter (liver-specific; 
Pinkert et al., 1987, Genes Dev. 1:268-277), lymphoid-specific promoters (Caiame 

30 and Eaton, 1988, Adv. Immunol. 43:235-275), in particular promoters of T cell 

receptors (Winoto and Baltimore, 1989, EMBOJ. 8:729-733) and immunoglobulins 
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(Banerji et al., 1983, Cell 33:729-740; Queen and Baltimore, 1983, Cell 33:741-748), 
neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle, 1989, 
Proc. Natl Acad. Sci. USA 86:5473-5477), pancreas-specific promoters (Edlund et al., 
1985, Science 230:912-916), and mammary gland-specific promoters {e.g., milk whey 
5 promoter; U.S. Patent No. 4,873,3 16 and European Application Publication No. 
264,166). Developmentally-regulated promoters are also encompassed, for example 
the murine hox promoters (Kessel and Gruss, 1990, Science 249:374-379) and the a- 
fetoprotein promoter (Campes and Tilghman, 1989, Genes Dev. 3:537-546). 

The invention fiirther provides a recombinant expression vector comprising a 

10 DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operably linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to the mRNA encoding a polypepfide of the 
invention. Regulatory sequences operably linked to a nucleic acid cloned in the 

1 5 antisense orientation can be chosen which direct the continuous expression of the 
antisense RNA molecule in a variety of cell types, for instance viral promoters and/or 
enhancers, or regulatory sequences can be chosen which direct constitutive, tissue 
specific or cell type specific expression of antisense RNA. The antisense expression 
vector can be in the form of a recombinant plasmid, phagemid or attenuated virus in 

20 which antisense nucleic acids are produced under the control of a high efficiency 
regulatory region, the activity of which can be determined by the cell type into which 
the vector is introduced. For a discussion of the regulation of gene expression using 
antisense genes see Weintraub et al. (1986, Reviews - Trends in Genetics, Vol. 1). 

Another aspect of the invention pertains to host cells into which a recombinant 

25 expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may 

30 not, in fact, be identical to the parent cell, but are still included within the scope of the 
term as used herein. 
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A host cell can be any prokaryotic {e.g., E. coli) or eukaryotic cell {e.g., insect 
cells, yeast or mammalian cells). 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
5 "transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells can be found in Sambrook, et al. {supra), and other laboratory manuals, 

10 For stable transfection of mammalian cells, it is known that, depending upon 

the expression vector and transfection technique used, only a small fraction of cells 
may integrate the foreign DNA into their genome. In order to identify and select 
these integrants, a gene that encodes a selectable marker {e.g., for resistance to 
antibiotics) is generally introduced into the host cells along with the gene of interest. 

15 Preferred selectable markers include those which confer resistance to drugs, such as 
G418, hygromycin and methotrexate. Cells stably transfected with the introduced 
nucleic acid can be identified by drug selection {e.g., cells that have incorporated the 
selectable marker gene will survive, while the other cells die). 

In another embodiment, the expression characteristics of an endogenous {e.g., 

20 TANGO 244, TANGO 246, TANGO 275, TANGO 300, and MANGO 245) nucleic 
acid within a cell, cell line or microorganism may be modified by inserting a DNA 
regulatory element heterologous to the endogenous gene of interest into the genome 
of a cell, stable cell line or cloned microorganism such that the inserted regulatory 
element is operatively linked with the endogenous gene {e.g., TANGO 244, TANGO 

25 246, TANGO 275, TANGO 300, and MANGO 245) and controls, modulates or 
activates the endogenous gene. For example, endogenous TANGO 244, TANGO 
246, TANGO 275, TANGO 300, and MANGO 245 which are normally 
"transcriptionally silent", i.e„ TANGO 244, TANGO 246, TANGO 275, TANGO 
300, and MANGO 245 genes which are normally not expressed, or are expressed only 

30 al very low levels in a cell line or microorganism, may be activated by inserting a 
regulatory element which is capable of promoting the expression of a normally 
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expressed gene product in that cell line or microorganism. Alternatively, 
transcriptionally silent, endogenous TANGO 244, TANGO 246, TANGO 275, 
TANGO 300, and MANGO 245 genes may be activated by insertion of a 
promiscuous regulatory element that works across cell types. 
5 A heterologous regulatory element may be inserted into a stable cell line or 

cloned microorganism, such that it is operatively linked with and activates expression 
of endogenous TANGO 244, TANGO 246, TANGO 275, TANGO 300, and MANGO 
245 genes, using techniques, such as targeted homologous recombination, which are 
well known to those of skill in the art, and described e.g., in Chappel, U.S. Patent No. 

10 5,272,071; PCT publication No. WO 91/06667, published May 16, 1991. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce a polypeptide of the invention. Accordingly, the 
invention further provides methods for producing a polypeptide of the invention using 
the host cells of the invention. In one embodiment, the method comprises culturing 

15 the host cell of invention (into which a recombinant expression vector encoding a 
polypeptide of the invention has been introduced) in a suitable medium such that the 
polypeptide is produced. In another embodiment, the method further comprises 
isolating the polypeptide from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman 

20 transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which a sequences encoding a 
polypeptide of the invention have been introduced. Such host cells can then be used 
to create non-human transgenic animals in which exogenous sequences encoding a 
polypeptide of the invention have been introduced into their genome or homologous 

25 recombinant animals in which endogenous encoding a polypeptide of the invention 
sequences have been altered. Such animals are useful for studying the function and/or 
activity of the polypeptide and for identifying and/or evaluating modulators of 
polypeptide activity. As used herein, a "transgenic animal" is a non-human animal, 
preferably a mammal, more preferably a rodent such as a rat or mouse, in which one 

30 or more of the cells of the animal includes a transgene. Other examples of transgenic 
animals include non-human primates, sheep, dogs, cows, goats, chickens, amphibians, 
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etc. A transgene is exogenous DNA which is integrated into the genome of a cell 
from which a transgenic animal develops and which remains in the genome of the 
mature animal, thereby directing the expression of an encoded gene product in one or 
more cell types or tissues of the transgenic animal. As used herein, an "homologous 
5 recombinant animal'* is a non-human animal, preferably a manmial, more preferably a 
mouse, in which an endogenous gene has been altered by homologous recombination 
between the endogenous gene and an exogenous DNA molecule introduced into a cell 
of the animal, e.g,, an embryonic cell of the animal, prior to development of the 
animal. 

10 A transgenic animal of the invention can be created by introducing nucleic 

acid encoding a polypeptide of the invention (or a homologue thereof) into the male 
pronuclei of a fertihzed oocyte, e.g., by microinjection, retroviral infection, and 
allowing the oocyte to develop in a pseudopregnant female foster animal. Intronic 
sequences and polyadenylation signals can also be included in the transgene to 

15 increase the efficiency of expression of the transgene. A tissue-specific regulatory 
sequence(s) can be operably linked to the transgene to direct expression of the 
polypeptide of the invention to particular cells. Methods for generating transgenic 
animals via embryo manipulation and microinjection, particularly animals such as 
mice, have become conventional in the art and are described, for example, in U.S. 

20 Patent NOs. 4,736,866 and 4,870,009, U.S. Patent No. 4,873,191 and in Hogan, 

Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y., 1986). Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of 
the transgene in its genome and/or expression of mRNA encoding the transgene in 

25 tissues or cells of the animals. A transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, transgenic animals carrying the 
transgene can further be bred to other transgenic animals carrying other transgenes. 

To create an homologous recombinant animal, a vector is prepared which 
contains at least a portion of a gene encoding a polypeptide of the invention into 

30 which a deletion, addition or substitution has been introduced to thereby alter, e.g., 
functionally disrupt, the gene. In a preferred embodiment, the vector is designed such 

82 



wo 01/00672 



PCT/lJSO(»/18184 



that, upon homologous recombination, the endogenous gene is functionally disrupted 
(i.e., no longer encodes a functional protein; also referred to as a "knock out" vector). 
Alternatively, the vector can be designed such that, upon homologous recombination, 
the endogenous gene is mutated or otherwise altered but still encodes ftinctional 
5 protein (e.g., the upstream regulatory region can be ahered to thereby alter the 

expression of the endogenous protein). In the homologous recombination vector, the 
altered portion of the gene is flanked at its 5' and 3' ends by additional nucleic acid of 
the gene to allow for homologous recombination to occur between the exogenous 
gene carried by the vector and an endogenous gene in an embryonic stem cell. The 

10 additional flanking nucleic acid sequences are of sufficient length for successful 

homologous recombination with the endogenous gene. Typically, several kilobases of 
flanking DNA (both at the 5' and 3* ends) are included in the vector (see, e.g., Thomas 
and Capecchi, 1987, Cell 51 :503 for a description of homologous recombination 
vectors). The vector is introduced into an embryonic stem cell line {e.g., by 

15 electroporation) and cells in which the introduced gene has homologously recombined 
with the endogenous gene are selected (see, e.g., Li et al., 1992, Cell 69:915). The 
selected cells are then injected into a blastocyst of an animal (e.g.^ a mouse) to form 
aggregation chimeras (see, e.g., Bradley in Teratocarcinomas and Embryonic Stem 
Cells: A Practical Approach, Robertson, ed. (IRL, Oxford, 1987) pgs. 1 13-152). A 

20 chimeric embryo can then be implanted into a suitable pseudopregnant female foster 
animal and the embryo brought to term. Progeny harboring the homologously 
recombined DNA in their germ cells can be used to breed animals in which all cells of 
the animal contain the homologously recombined DNA by germline transmission of 
the transgene. Methods for constructing homologous recombination vectors and 

25 homologous recombinant animals are described further in Bradley, 1991, Current 
Opinion in Bio/Technology 2:823-829 and in PCT Publication NOs. WO 90/1 1354, 
WO 91/01 140, WO 92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 

30 example of such a system is the cre/loxP recombinase system of bacteriophage PI . 
For a description of the cre/loxP recombinase system, see, e.g., Lakso et al., 1992, 
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Proc. Natl Acad. Sci. USA 89:6232-6236- Another example of a recombinase system 
is the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et aL, 1 991 , 
Science 251:1351-1355. If a creZ/ox/' recombinase system is used to regulate 
expression of the transgene, animals containing transgenes encoding both the Cre 
5 recombinase and a selected protein are required. Such animals can be provided 
through the construction of "double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a selected protein and the 
other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be 
10 produced according to the methods described in Wilmut et al., 1997, Nature 385:810- 
813 and PCT Publication NOs. WO 97/07668 and WO 97/07669. 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to 

15 herein as "active compounds") of the invention can be incorporated into 

pharmaceutical compositions suitable for administration. Such compositions typically 
comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically 
acceptable carrier. As used herein the language "pharmaceutically acceptable carrier" 
is intended to include any and all solvents, dispersion media, coatings, antibacterial 

20 and antifungal agents, isotonic and absorption delaying agents, and the like, 

compatible with pharmaceutical administration. The use of such media and agents for 
pharmaceutically active substances is well known in the art. Except insofar as any 
conventional media or agent is incompatible with the active compound, use thereof in 
the compositions is contemplated. Supplementary active compounds can also be 

25 incorporated into the compositions. 

The invention includes methods for preparing pharmaceutical compositions 
for modulating the expression or activity of a polypeptide or nucleic acid of the 
invention. Such methods comprise formulating a pharmaceutically acceptable carrier 
with an agent which modulates expression or activity of a polypeptide or nucleic acid 

30 of the invention. Such compositions can further include additional active agents. 
Thus, the invention further includes methods for preparing a pharmaceutical 
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composition by formulating a pharmaceutically acceptable carrier with an agent 
which modulates expression or activity of a polypeptide or nucleic acid of the 
invention and one or more additional active compounds. 

The agent which modulates expression or activity may, for example, be a 
5 small molecule. For example, such small molecules include peptides, 

peptidomimetics, amino acids, amino acid analogs, polynucleotides, polynucleotide 
analogs, nucleotides, nucleotide analogs, organic or inorganic compounds (i.e., 
including heteroorganic and organometallic compounds) having a molecular weight 
less than about 10,000 grams per mole, organic or inorganic compounds having a 

10 molecular weight less than about 5,000 grams per mole, organic or inorganic 

compounds having a molecular weight less than about 1,000 grams per mole, organic 
or inorganic compounds having a molecular weight less than about 500 grams per 
mole, and salts, esters, and other pharmaceutically acceptable forms of such 
compounds. It is understood that appropriate doses of small molecule agents depends 

1 5 upon a number of factors within the ken of the ordinarily skilled physician, 

veterinarian, or researcher. The dose(s) of the small molecule w^ill vary, for example, 
depending upon the identity, size, and condition of the subject or sample being 
treated, further depending upon the route by which the composition is to be 
administered, if applicable, and the effect which the practitioner desires the small 

20 molecule to have upon the nucleic acid or polypeptide of the invention. Exemplary 
doses include milligram or microgram amounts of the small molecule per kilogram of 
subject or sample weight (e.g. about 1 microgram per kilogram to about 500 
milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams 
per kilogram, or about 1 microgram per kilogram to about 50 micrograms per 

25 kilogram. It is furthermore understood that appropriate doses of a small molecule 
depend upon the potency of the small molecule with respect to the expression or 
activity to be modulated. Such appropriate doses may be determined using the assays 
described herein. When one or more of these small molecules is to be administered to 
an animal (e.g. a human) in order to modulate expression or activity of a polypeptide 

30 or nucleic acid of the invention, a physician, veterinarian, or researcher may, for 
example, prescribe a relatively low dose at first, subsequently increasing the dose 
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until an appropriate response is obtained. In addition, it is understood that the specific 
dose level for any particular animal subject will depend upon a variety of factors 
including the activity of the specific compound employed, the age, body weight, 
general health, gender, and diet of the subject, the time of administration, the route of 
5 administration, the rate of excretion, any drug combination, and the degree of 
expression or activity to be modulated. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration 
include parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g,^ inhalation), 

10 transdermal (topical), transmucosal, and rectal administration. Solutions or 

suspensions used for parenteral, intradermal, or subcutaneous application can include 
the following components: a sterile diluent such as water for injection, saline solution, 
fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants 

15 such as ascorbic acid or sodium bisulfite; chelating agents such as 

ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates and 
agents for the adjustment of tonicity such as sodium chloride or dextrose. pH can be 
adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The 
parenteral preparation can be enclosed in ampules, disposable syringes or multiple 

20 dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile 
aqueous solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersions. For 
intravenous administration, suitable carriers include physiological saline, 

25 bacteriostatic water, Cremophor ELJ (BASF; Parsippany, NJ) or phosphate buffered 
saline (PBS). In all cases, the composition must be sterile and should be fluid to the 
extent that easy syringability exists. It must be stable under the conditions of 
manufacture and storage and must be preserved against the contaminating action of 
microorganisms such as bacteria and fiingi. The carrier can be a solvent or dispersion 

30 medium containing, for example, water, ethanol, polyol (for example, glycerol, 

propylene glycol, and liquid polyetheylene glycol, and the like), and suitable mixtures 
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thereof. The proper fluidity can be maintained, for example, by the use of a coating 
such as lecithin, by the maintenance of the required particle size in the case of 
dispersion and by the use of surfactants. Prevention of the action of microorganisms 
can be achieved by various antibacterial and antifungal agents, for example, parabens, 
5 chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it v^ill 
be preferable to include isotonic agents, for example, sugars, polyalcohols such as 
mannitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the 
injectable compositions can be brought about by including in the composition an 
agent which delays absorption, for example, aluminum monostearate and gelatin. 

10 Sterile injectable solutions can be prepared by incorporating the active 

compound (e.g., a polypeptide or antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, 
followed by filtered sterilization. Generally, dispersions are prepared by 
incorporating the active compound into a sterile vehicle which contains a basic 

15 dispersion medium and the required other ingredients from those enumerated above. 
In the case of sterile powders for the preparation of sterile injectable solutions, the 
preferred methods of preparation are vacuum drying and fi-eeze-drying which yields a 
powder of the active ingredient plus any additional desired ingredient from a 
previously sterile-filtered solution thereof. 

20 Oral compositions generally include an inert diluent or an edible carrier. They 

can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients 
and used in the form of tablets, troches, or capsules. Oral compositions can also be 
prepared using a fluid carrier for use as a mouthwash, wherein the compound in the 

25 fluid carrier is applied orally and swished and expectorated or swallowed. 

Pharmaceutically compatible binding agents, and/or adjuvant materials can be 
included as part of the composition. The tablets, pills, capsules, troches and the like 
can contain any of the following ingredients, or compounds of a similar nature: a 
binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such 

30 as starch or lactose, a disintegrating agent such as alginic acid, Primogel, or com 
starch; a lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal 
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silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are deHvered in the form of 
an aerosol spray from a pressurized container or dispenser which contains a suitable 
5 propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. 
For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration, detergents, bile 
10 salts, and fusidic acid derivatives. Transmucosal administration can be accomplished 
through the use of nasal sprays or suppositories. For transdermal administration, the 
active compounds are formulated into ointments, salves, gels, or creams as generally 
known in the art. 

The compounds can also be prepared in the form of suppositories {e.g., with 
1 5 conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
20 Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the 
art. The materials can also be obtained commercially from Alza Corporation and 
Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to 
25 infected cells with monoclonal antibodies to viral antigens) can also be used as 
pharmaceutically acceptable carriers. These can be prepared according to methods 
known to those skilled in the art, for example, as described in U.S. Patent No. 
4,522,811. 

It is especially advantageous to formulate oral or parenteral compositions in 
30 dosage unit form for ease of administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units suited as unitary dosages for the 
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subject to be treated; each unit containing a predetermined quantity of active 
compound calculated to produce the desired therapeutic effect in association with the 
required pharmaceutical carrier. The specification for the dosage unit forms of the 
invention are dictated by and directly dependent on the unique characteristics of the 
5 active compound and the particular therapeutic effect to be achieved^ and the 
limitations inherent in the art of compounding such an active compound for the 
treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of body weight 
(generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 

10 50 mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies 
and fiilly human antibodies have a longer half-life within the human body than other 
antibodies. Accordingly, lower dosages and less frequent administration is often 
possible. Modifications such as lipidation can be used to stabilize antibodies and to 
enhance uptake and tissue penetration (e.g., into the brain). A method for lipidation 

1 5 of antibodies is described by Cruikshank et al. (1 997, J. Acquired Immune Deficiency 
Syndromes and Human Retrovirology 14:193). 

The nucleic acid molecules of the invention can be inserted into vectors and 
used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, 
for example, intravenous injection, local administration (U.S. Patent 5,328,470) or by 

20 stereotactic injection (see, e.g., Chen et al, 1994, Proc. Natl Acad, Set. USA 91 :3054- 
3057). The pharmaceutical preparation of the gene therapy vector can include the 
gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in 
which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery vector can be produced intact from recombinant cells, e.g,, retroviral vectors, 

25 the pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 



30 
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V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues, and antibodies 
described herein can be used in one or more of the following methods: a) screening 
assays; b) detection assays {e.g,, chromosomal mapping, tissue typing, forensic 
5 biology); c) predictive medicine {e.g., diagnostic assays, prognostic assays, 

monitoring clinical trials, and pharmacogenomics); and d) methods of treatment (e.g., 
therapeutic and prophylactic). For example, the TANGO 244, TANGO 246, TANGO 
275, TANGO 300 and MANGO 245 polypeptides of the invention can to used to 
modulate cellular function, survival, morphology, proliferation, and/or differentiation 

10 of the cells in which they are expressed. The isolated nucleic acid molecules of the 
invention can be used to express proteins (e.g., via a recombinant expression vector in 
a host cell in gene therapy applications), to detect mRNA (e.g., in a biological sample) 
or a genetic lesion, and to modulate activity of a polypeptide of the invention. In 
addition, the polypeptides of the invention can be used to screen drugs or compounds 

15 which modulate activity or expression of a polypeptide of the invention as well as to 
treat disorders characterized by insufficient or excessive production of a protein of 
the invention or production of a form of a protein of the invention which has 
decreased or aberrant activity compared to the wild type protein. In addition, the 
antibodies of the invention can be used to detect and isolate a protein of the and 

20 modulate activity of a protein of the invention. 

This invention further pertains to novel agents identified by the above- 
described screening assays and uses thereof for treatments as described herein. 

A. Screening Assays 

25 The invention provides a method (also referred to herein as a "screening 

assay'*) for identifying modulators, i.e., candidate or test compounds or agents (e.g., 
peptides, peptidomimetics, small molecules or other drugs) which bind to polypeptide 
of the invention or have a stimulatory or inhibitory effect on, for example, expression 
or activity of a polypeptide of the invention, 

30 In one embodiment, the invention provides assays for screening candidate or 

test compounds which bind to or modulate the activity of the membrane-bound form 
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of a polypeptide of the invention or biologically active portion thereof. The test 
compounds of the present invention can be obtained using any of the numerous 
approaches in combinatorial library methods known in the art, including: biological 
libraries; spatially addressable parallel solid phase or solution phase libraries; 
5 synthetic library methods requiring deconvolution; the "one-bead one-compound" 
library method; and synthetic library methods using affinity chromatography 
selection. The biological library approach is limited to peptide libraries, while the 
other four approaches are applicable to peptide, non-peptide oligomer or small 
molecule libraries of compounds (Lam, 1997, Anticancer Drug Des, 12:145). 

1 0 Examples of methods for the synthesis of molecular libraries can be found in 

the art, for example in: DeWitt et al, 1993, Proc, Natl. Acad. Sci. USA 90:6909; Erb 
et al., 1994, Proc. Natl. Acad. ScL USA 91:1 1422; Zuckermann et al., 1994,. J. Med. 
Chem. 37:2678; Cho et al., 1993, Science 261 :1303; Carrell et ah, 1994, Angew. 
Chem. Int. Ed. Engl. 33:2059; Carell et al., 1994, Angew. Chem. Int. Ed. Engl. 

15 33:2061; and Gallop etaL, 1994, J. Med Chem. 37:1233. 

Libraries of compounds may be presented in solution {e.g., Houghten, 1992, 
Bio/Techniques 13:412-421), or on beads (Lam, 1991, A^a^wre 354:82-84), chips 
(Fodor, 1993, Nature 364:555-556), bacteria (U.S. Patent No. 5,223,409), spores 
(Patent NOs. 5,571,698; 5,403,484; and 5,223,409), plasmids (Cull et al., 1992, Proc. 

20 Natl. Acad Sci. USA 89:1865-1869) or phage (Scott and Smith, 1990, Science 
249:386-390; Devlin, 1990, Science 249:404-406; Cwirla et al., 1990, Proc. Natl. 
Acad Sci. USA 87:6378-6382; and Felici, 1991, J. MoL Biol. 222:301-310). 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses a membrane-bound form of a polypeptide of the invention, or a biologically 

25 active portion thereof, on the cell surface is contacted with a test compound and the 
ability of the test compound to bind to the polypeptide determined. The cell, for 
example, can be a yeast cell or a cell of mammalian origin. Determining the ability of 
the test compoimd to bind to the polypeptide can be accomplished, for example, by 
coupling the test compound with a radioisotope or enzymatic label such that binding 

30 of the test compound to the polypeptide or biologically active portion thereof can be 
determined by detecting the labeled compound in a complex. For example, test 
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compounds can be labeled with '^^I, ^'S, '^C, or ^H, either directly or indirectly, and 
the radioisotope detected by direct counting of radioemmission or by scintillation 
counting. Alternatively, test compounds can be enzymatically labeled with, for 
example, horseradish peroxidase, alkaline phosphatase, or luciferase, and the 
5 enzymatic label detected by determination of conversion of an appropriate substrate to 
product. In a preferred embodiment, the assay comprises contacting a cell which 
expresses a membrane-bound form of a polypeptide of the invention, or a biologically 
active portion thereof, on the cell surface with a known compound which binds the 
polypeptide to form an assay mixture, contacting the assay mixture with a test 
10 compound, and determining the ability of the test compound to interact with the 

polypeptide, wherein determining the ability of the test compound to interact with the 
polypeptide comprises determining the ability of the test compound to preferentially 
bind to the polypeptide or a biologically active portion thereof as compared to the 
known compound. 

15 In another embodiment, the assay involves assessment of an activity 

characteristic of the polypeptide, wherein binding of the test compound with the 
polypeptide or a biologically active portion thereof alters (e.g., increases or decreases) 
the activity of the polypeptide. 

In another embodiment, an assay is a cell-based assay comprising contacting a 

20 cell expressing a membrane-bound form of a polypeptide of the invention, or a 
biologically active portion thereof, on the cell surface with a test compound and 
determining the ability of the test compound to modulate {e.g., stimulate or inhibit) 
the activity of the polypeptide or biologically active portion thereof. Determining the 
ability of the test compound to modulate the activity of the polypeptide or a 

25 biologically active portion thereof can be accomplished, for example, by determining 
the ability of the polypeptide protein to bind to or interact with a target molecule or to 
transport molecules across the cytoplasmic membrane. 

Determining the ability of a polypeptide of the invention to bind to or interact 
with a target molecule can be accomplished by one of the methods described above 

30 for determining direct binding. As used herein, a "target molecule" is a molecule with 
which a selected polypeptide (e.g., a polypeptide of the invention binds or interacts 
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with in nature, for example, a molecule on the surface of a cell which expresses the 
selected protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell 
membrane or a cytoplasmic molecule. A target molecule can be a polypeptide of the 
5 invention or some other polypeptide or protein. For example, a target molecule can 
be a component of a signal transduction pathway which facilitates transduction of an 
extracellular signal (e.g., a signal generated by binding of a compound to a 
polypeptide of the invention) through the cell membrane and into the cell or a second 
intercellular protein which has catalytic activity or a protein which facilitates the 

10 association of downstream signaling molecules with a polypeptide of the invention. 
Determining the ability of a polypeptide of the invention to bind to or interact with a 
target molecule can be accomplished by determining the activity of the target 
molecule. For example, the activity of the target molecule can be determined by 
detecting induction of a cellular second messenger of the target {e.g., intracellular 

15 Ca^", diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the target on 
an appropriate substrate, detecting the induction of a reporter gene {e.g., a regulatory 
element that is responsive to a polypeptide of the invention operably linked to a 
nucleic acid encoding a detectable marker, e.g. luciferase), or detecting a cellular 
response, for example, cellular differentiation, or cell proliferation. 

20 In yet another embodiment, an assay of the present invention is a cell-free 

assay comprising contacting a polypeptide of the invention or biologically active 
portion thereof with a test compound and determining the ability of the test compound 
to bind to the polypeptide or biologically active portion thereof Binding of the test 
compound to the polypeptide can be determined either directly or indirectly as 

25 described above. In a preferred embodiment, the assay includes contacting the 
polypeptide of the invention or biologically active portion thereof with a known 
compound which binds the polypeptide to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability of the test compound to 
interact with the polypeptide, wherein determining the ability of the test compound to 

30 interact with the polypeptide comprises determining the ability of the test compound 
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to preferentially bind to the polypeptide or biologically active portion thereof as 
compared to the known compound. 

In another embodiment, an assay is a cell-free assay comprising contacting a 
polypeptide of the invention or biologically active portion thereof with a test 
5 compound and determining the ability of the test compound to modulate (e.g., 
stimulate or inhibit) the activity of the polypeptide or biologically active portion 
thereof. Determining the ability of the test compound to modulate the activity of the 
polypeptide can be accomplished, for example, by determining the ability of the 
polypeptide to bind to a target molecule by one of the methods described above for 

10 determining direct binding. In an altemative embodiment, determining the ability of 
the test compound to modulate the activity of the polypeptide can be accomplished by 
determining the ability of the polypeptide of the invention to further modulate the 
target molecule. For example, the catalytic/enzymatic activity of the target molecule 
on an appropriate substrate can be determined as previously described. 

15 In yet another embodiment, the cell-free assay comprises contacting a 

polypeptide of the invention or biologically active portion thereof with a known 
compound which binds the polypeptide to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the abihty of the test compound to 
interact with the polypeptide, wherein determining the ability of the test compound to 

20 interact with the polypeptide comprises determining the ability of the polypeptide to 
preferentially bind to or modulate the activity of a target molecule. 

The cell-free assays of the present invention are amenable to use of both a 
soluble form or the membrane-bound form of a polypeptide of the invention. In the 
case of cell-free assays comprising the membrane-bound form of the polypeptide, it 

25 may be desirable to utilize a solubilizing agent such that the membrane-bound form of 
the polypeptide is maintained in solution. Examples of such solubilizing agents 
include non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, n- 
octybnaltoside, octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton 
X-100, Triton X-1 14, Thesit, Isotridecypoly(ethylene glycol ether)n, 3-[(3- 

30 cholamidopropyl)dimethylanaminio]-l -propane sulfonate (CHAPS), 3-[(3- 
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cholamidopropyl)dimethylarnminio]-2-hydroxy-l -propane sulfonate (CHAPSO), or 
N-dodecyl=N,N-dimethy 1-3-ammonio- 1 -propane sulfonate. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either the polypeptide of the invention or 
5 its target molecule to facilitate separation of complexed from uncomplexed forms of 
one or both of the proteins, as well as to accommodate automation of the assay. 
Binding of a test compound to the polypeptide, or interaction of the polypeptide with 
a target molecule in the presence and absence of a candidate compound, can be 
accomplished in any vessel suitable for containing the reactants. Examples of such 

1 0 vessels include microliter plates, test tubes, and micro-centrifuge tubes. In one 

embodiment, a fusion protein can be provided which adds a domain that allows one or 
both of the proteins to be bound to a matrix. For example, glutathione-S-transferase 
fusion proteins or glutathione-S-transferase fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical; St. Louis, MO) or glutathione 

1 5 derivatized microliter plates, which are then combined with the test compound or the 
test compound and either the non-adsorbed target protein or A polypeptide of the 
invention, and the mixture incubated under conditions conducive to complex 
formation (e.g., at physiological conditions for salt and pH). Following incubation, 
the beads or microtiter plate wells are washed to remove any unbound components 

20 and complex formation is measured either directly or indirectly, for example, as 
described above. Alternatively, the complexes can be dissociated from the matrix, 
and the level of binding or activity of the polypeptide of the invention can be 
determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 

25 screening assays of the invention. For example, either the polypeptide of the 

invention or its target molecule can be immobiUzed utilizing conjugation of biotin and 
streptavidin. Biotinylated polypeptide of the invention or target molecules can be 
prepared from biotin-NHS (N-hydroxy-succinimide) using techniques well known in 
the art (eg., biotinylation kit, Pierce Chemicals; Rockford, IL), and immobilized in 

30 the wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, 
antibodies reactive with the polypeptide of the invention or target molecules but 
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which do not interfere with binding of the polypeptide of the invention to its target 
molecule can be derivatized to the wells of the plate, and unbound target or 
polypeptide of the invention trapped in the wells by antibody conjugation. Methods 
for detecting such complexes, in addition to those described above for the GST- 
5 immobilized complexes, include immunodetection of complexes using antibodies 
reactive with the polypeptide of the invention or target molecule, as well as enzyme- 
linked assays which rely on detecting an enzymatic activity associated with the 
polypeptide of the invention or target molecule. 

In another embodiment, modulators of expression of a polypeptide of the 

10 invention are identified in a method in which a cell is contacted with a candidate 
compound and the expression of the selected mRNA or protein (i.e., the mRNA or 
protein corresponding to a polypeptide or nucleic acid of the invention) in the cell is 
determined. The level of expression of the selected mRNA or protein in the presence 
of the candidate compound is compared to the level of expression of the selected 

1 5 mRNA or protein in the absence of the candidate compound. The candidate 

compound can then be identified as a modulator of expression of the polypeptide of 
the invention based on this comparison. For example, when expression of the 
selected mRNA or protein is greater (statistically significantly greater) in the presence 
of the candidate compound than in its absence, the candidate compound is identified 

20 as a stimulator of the selected mRNA or protein expression. Alternatively, when 
expression of the selected mRNA or protein is less (statistically significantly less) in 
the presence of the candidate compound than in its absence, the candidate compound 
is identified as an inhibitor of the selected mRNA or protein expression. The level of 
the selected mRNA or protein expression in the cells can be determined by methods 

25 described herein. 

In yet another aspect of the invention, a polypeptide of the inventions can be 
used as "bait proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al., 1993, Cell 72:223-232; Madura et al., 1993, J, 
Biol Chem, 268:12046-12054; Bartel et al, 1993, Bio/Techniques 14:920-924; 

30 Iwabuchi et al., 1993, Oncogene 8:1693-1696; and PCT PubUcation No. WO 

94/10300), to identify other proteins, which bind to or interact with the polypeptide of 
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the invention and modulate activity of the polypeptide of the invention. Such binding 
proteins are also likely to be involved in the propagation of signals by the polypeptide 
of the inventions as, for example, upstream or downstream elements of a signaling 
pathway involving the polypeptide of the invention. 
5 This invention further pertains to novel agents identified by the above- 

described screening assays and uses thereof for treatments as described herein. 

B. Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 

10 corresponding complete gene sequences) can be used in numerous ways as 

polynucleotide reagents. For example, these sequences can be used to: (i) map their 
respective genes on a chromosome and, thus, locate gene regions associated with 
genetic disease; (ii) identify an individual from a minute biological sample (tissue 
typing); and (iii) aid in forensic identification of a biological sample. These 

15 applications are described in the subsections below. 

1. Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, 
this sequence can be used to map the location of the gene on a chromosome. 

20 Accordingly, nucleic acid molecules described herein or fragments thereof can be 
used to map the location of the corresponding genes on a chromosome. The mapping 
of the sequences to chromosomes is an important first step in correlating these 
sequences with genes associated with disease. 

Briefly, genes can be mapped to chromosomes by preparing PGR primers 

25 (preferably 1 5 to 25 bp in length) from the sequence of a gene of the invention. 
Computer analysis of the sequence of a gene of the invention can be used to rapidly 
select primers that do not span more than one exon in the genomic DNA, thus 
complicating the amplification process. These primers can then be used for PCR 
screening of somatic cell hybrids containing individual human chromosomes. Only 

30 those hybrids containing the human gene corresponding to the gene sequences will 
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yield an amplified fragment. For a review of this technique, see D'Eustachio et al. 
Science 220:919-924). 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a , 
particular sequence to a particular chromosome. Three or more sequences can be 
5 assigned per day using a single thermal cycler. Using the nucleic acid sequences of 
the invention to design oligonucleotide primers, sublocalization can be achieved with 
panels of fragments from specific chromosomes. Other mapping strategies which can 
similarly be used to map a gene to its chromosome include in situ hybridization 
(described in Fan et al., 1990, Proc, Natl. Acad Sci USA 87:6223-27), pre-screening 

10 with labeled flow-sorted chromosomes (CITE), and pre-selection by hybridization to 
chromosome specific cDNA libraries. Fluorescence in situ hybridization (FISH) of a 
DNA sequence to a metaphase chromosomal spread can further be used to provide a 
precise chromosomal location in one step. For a review of this technique, see Verma 
et al., (Human Chromosomes: A Manual of Basic Techniques (Pergamon Press, New 

15 York, 1988), 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used 
for marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are preferred for mapping purposes. Coding 

20 sequences are more likely to be conserved within gene families, thus increasing the 
chance of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic 
map data. (Such data are found, for example, in V. McKusick, Mendelian Inheritance 

25 in Man, available on-line through Johns Hopkins University Welch Medical Library). 
The relationship between genes and disease, mapped to the same chromosomal 
region, can then be identified through linkage analysis (co-inheritance of physically 
adjacent genes), described in, e.g., Egeland et al., 1987, Nature 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 

30 unaffected with a disease associated with a gene of the invention can be determined. 
If a mutation is observed in some or all of the affected individuals but not in any 
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unaffected individuals, then the mutation is likely to be the causative agent of the 
particular disease. Comparison of affected and unaffected individuals generally 
involves first looking for structural alterations in the chromosomes such as deletions 
or translocations that are visible fi:om chromosome spreads or detectable using PCR 
5 based on that DNA sequence. Ultimately, complete sequencing of genes fi"om several 
individuals can be performed to confirm the presence of a mutation and to distinguish 
mutations from polymorphisms. 

Furthermore, the nucleic acid sequences disclosed herein can be used to 
perform searches against "mapping databases", e.g., BLAST-type search, such that the 

10 chromosome position of the gene is identified by sequence homology or identity with 
known sequence firagments which have been mapped to chromosomes, 

A polypeptide and fi*agments and sequences thereof and antibodies specific 
thereto can be used to map the location of the gene encoding the polypeptide on a 
chromosome. This mapping can be carried out by specifically detecting the presence 

1 5 of the polypeptide in members of a panel of somatic cell hybrids between cells of a 
first species of animal fi*om which the protein originates and cells fi*om a second 
species of animal and then determining which somatic cell hybrid(s) expresses the 
polypeptide and nofing the chromosome(s) fi-om the first species of animal that it 
contains. For examples of this technique, see Pajunen et al., 1988, Cytogenet. Cell 

20 Genet 47:37-41 and Van Keuren et al, 1986, Hum. Genet 74:34-40. Alternatively, 
the presence of the polypepfide in the somatic cell hybrids can be determined by 
assaying an activity or property of the polypeptide, for example, enzymatic activity, as 
described in Bordelon-Riser et al, 1979, Somatic Cell Genetics 5:597-613 and 
Owerbach et al., 1978. Proc. Natl. Acad. Sci. USA 75:5640-5644. 

25 

2. Tissue Tvping 

The nucleic acid sequences of the present invention can also be used to 
identify individuals fi'om minute biological samples. The United States military, for 
example, is considering the use of restriction fi-agment length polymorphisms (RFLP) 
30 for identification of its personnel. In this technique, an individual's genomic DNA is 
digested with one or more restriction enzymes, and probed on a Southern blot to yield 
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unique bands for identification. This method does not suffer from the current 
Hmitations of "Dog Tags" which can be lost, switched, or stolen, making positive 
identification difficult. The sequences of the present invention are useful as additional 
DNA markers for RFLP (described in U.S. Patent 5,272,057). 
5 Furthermore, the sequences of the present invention can be used to provide an 

alternative technique which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the nucleic acid sequences 
described herein can be used to prepare two PCR primers from the 5' and 3' ends of 
the sequences. These primers can then be used to amplify an individual's DNA and 

10 subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this 
manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
present invention can be used to obtain such identification sequences from individuals 

1 5 and from tissue. The nucleic acid sequences of the invention uniquely represent 
portions of the human genome. Allelic variation occurs to some degree in the coding 
regions of these sequences, and to a greater degree in the noncoding regions. It is 
estimated that allelic variation between individual humans occurs with a frequency of 
about once per each 500 bases. Each of the sequences described herein can, to some 

20 degree, be used as a standard against which DNA from an individual can be compared 
for identification purposes. Because greater numbers of polymorphisms occur in the 
noncoding regions, fewer sequences are necessary to differentiate individuals. The 
noncoding sequences of SEQ ID NOs:l, 4, 7, 10, 13, 16, 19, 22 and 25 can 
comfortably provide positive individual identification with a panel of perhaps 10 to 

25 1 ,000 primers which each yield a noncoding amplified sequence of 1 00 bases. If 
predicted coding sequences, such as those in SEQ ID N0s:3, 6, 9, 12, 15, 18, 21 and 
24 are used, a more appropriate number of primers for positive individual 
identification would be 500-2,000. 

If a panel of reagents from the nucleic acid sequences described herein is used 

30 to generate a unique identification database for an individual, those same reagents can 
later be used to identify tissue from that individual. Using the unique identification 
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database, positive identification of the individual, living or dead, can be made from 
extremely small tissue samples. 

3. Use of Partial Gene Sequences in Forensic Biology 
5 DNA-based identification techniques can also be used in forensic biology. 

Forensic biology is a scientific field employing genetic typing of biological evidence 
found at a crime scene as a means for positively identifying, for example, a 
perpetrator of a crime. To make such an identification, PGR technology can be used 
to amplify DNA sequences taken fi'om very small biological samples such as tissues, 

10 e.g., hair or skin, or body fluids, e.g., blood, saliva, or semen found at a crime scene. 
The amplified sequence can then be compared to a standard, thereby allowing 
identification of the origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PGR primers, targeted to specific loci in the human genome, which can 

15 enhance the reliability of DNA-based forensic identifications by, for example, 

providing another "identification marker" (i.e. another DNA sequence that is unique 
to a particular individual). As mentioned above, actual base sequence information can 
be used for identification as an accurate alternative to patterns formed by restriction 
enzyme generated fragments. Sequences targeted to noncoding regions are 

20 particularly appropriate for this use as greater numbers of polymorphisms occur in the 
noncoding regions, making it easier to differentiate individuals using this technique. 
Examples of polynucleotide reagents include the nucleic acid sequences of the 
invention or portions thereof, e.g., fragments derived from noncoding regions having 
a length of at least 20 or 30 bases. 

25 The nucleic acid sequences described herein can further be used to provide 

polynucleotide reagents, e.g., labeled or labelable probes which can be used in, for 
example, an in situ hybridization technique, to identify a specific tissue, e.g., brain 
tissue. This can be very useful in cases where a forensic pathologist is presented with 
a tissue of unknown origin. Panels of such probes can be used to identify tissue by 

30 species and/or by organ type. 
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C. Predictive Medicine: 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 
5 Accordingly, one aspect of the present invention relates to diagnostic assays for 
determining TANGO 244, TANGO 246, TANGO 275, TANGO 300, or MANGO 
245 protein and/or nucleic acid expression as well as TANGO 244, TANGO 246, 
TANGO 275, TANGO 300, or MANGO 245 activity, in the context of a biological 
sample {e.g., blood, serum, cells, tissue) to thereby determine whether an individual is 

10 afflicted with a disease or disorder, or is at risk of developing a disorder, associated 
with aberrant or unwanted TANGO 244, TANGO 246, TANGO 275, TANGO 300, or 
MANGO 245 expression or activity. The invention also provides for prognostic (or 
predictive) assays for determining whether an individual is at risk of developing a 
disorder associated with TANGO 244, TANGO 246, TANGO 275, TANGO 300, or 

1 5 MANGO 245 protein, nucleic acid expression or activity. For example, mutations in 
a TANGO 244, TANGO 246, TANGO 275, TANGO 300, or MANGO 245 gene can 
be assayed in a biological sample. Such assays can be used for prognostic or 
predictive purpose to thereby prophylactically treat an individual prior to the onset of 
a disorder characterized by or associated with TANGO 244, TANGO 246, TANGO 

20 275, TANGO 300, or MANGO 245 protein, nucleic acid expression or activity. 

As an alternative to making determinations based on the absolute 
expression level of selected genes, determinations may be based on the normalized 
expression levels of these genes. Expression levels are normalized by correcting the 
absolute expression level of a TANGO 244, TANGO 246, TANGO 275, TANGO 

25 300, or MANGO 245 gene by comparing its expression to the expression of a gene 
that is not a TANGO 244, TANGO 246, TANGO 275, TANGO 300, or MANGO 245 
gene, e.g.^ a housekeeping gene that is constitutively expressed. Suitable genes for 
normalization include housekeeping genes such as the actin gene. This normalization 
allows the comparison of the expression level in one sample, e.g,, a patient sample, to 

30 another sample, e.g., a non-disease sample, or between samples from different 
sources. 
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Alternatively, the expression level can be provided as a relative expression 
level. To determine a relative expression level of a gene, the level of expression of 
the gene is determined for 10 or more samples of different cell isolates, preferably 50 
or more samples, prior to the determination of the expression level for the sample in 
5 question. The mean expression level of each of the genes assayed in the larger 

number of samples is determined and this is used as a baseline expression level for the 
gene(s) in question. The expression level of the gene determined for the test sample 
(absolute level of expression) is then divided by the mean expression value obtained 
for that gene. This provides a relative expression level and aids in identifying extreme 

10 cases of disease. 

Preferably, the samples used in the baseline determination will be from 
diseased or from non-diseased cells. The choice of the cell source is dependent on the 
use of the relative expression level. Using expression found in normal tissues as a 
mean expression score aids in validating whether the TANGO 244, TANGO 246, 

15 TANGO 275, TANGO 300, or MANGO 245 gene assayed is cell-type specific 
(versus normal cells). Such a use is particularly important in identifying whether a 
TANGO 244, TANGO 246, TANGO 275, TANGO 300, or MANGO 245 gene can 
serve as a target gene. In addition, as more data is accumulated, the mean expression 
value can be revised, providing improved relative expression values based on 

20 accumulated data. Expression data from cells provides a means for grading the 
severity of the disease state. 

Another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs, compounds) on the expression or activity of TANGO 244, TANGO 246, 
TANGO 275, TANGO 300, or MANGO 245 in cUnical trials. 

25 These and other agents are described in ftirther detail in the following sections. 

1. Dia j gnostic Assays 

An exemplary method for detecting the presence or absence of a polypeptide 
or nucleic acid of the invention in a biological sample involves obtaining a biological 
30 sample from a test subject and contacting the biological sample with a compound or 
an agent capable of detecting a polypeptide or nucleic acid (e.g., mRNA, genomic 
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DNA) of the invention such that the presence of a polypeptide or nucleic acid of the 
invention is detected in the biological sample. A preferred agent for detecting nxRNA 
or genomic DNA encoding a polypeptide of the invention is a labeled nucleic acid 
probe capable of hybridizing to mRNA or genomic DNA encoding a polypeptide of 
5 the invention. The nucleic acid probe can be, for example, a full-length cDNA, such 
as the nucleic acid of SEQ ID NOs:l, 3, 4, 6, 7, 9,10, 12, 13, 15, 16, 18, 19, 22, 24 
and 25, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 
or 500 nucleotides in length and sufficient to specifically hybridize under stringent 
conditions to a mRNA or genomic DNA encoding a polypeptide of the invention. 
10 Other suitable probes for use in the diagnostic assays of the invention are described 
herein. 

A preferred agent for detecting a polypeptide of the invention is an antibody 
capable of binding to a polypeptide of the invention, preferably an antibody with a 
detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An 

15 intact antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term 
"labeled", with regard to the probe or antibody, is intended to encompass direct 
labeling of the probe or antibody by coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of 

20 indirect labeling include detection of a primary antibody using a fluorescently labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be 
detected with fluorescently labeled streptavidin. The term "biological sample" is 
intended to include tissues, cells and biological fluids isolated from a subject, as well 
as tissues, cells and fluids present within a subject. That is, the detection method of 

25 the invention can be used to detect mRNA, protein, or genomic DNA in a biological 
sample in vitro as well as in vivo. For example, in vitro techniques for detection of 
mRNA include Northern hybridizations and in situ hybridizations. In vitro techniques 
for detection of a polypeptide of the invention include enzyme linked immunosorbent 
assays (ELISAs), Western blots, immunoprecipitations and immunofluorescence. In 

30 vitro techniques for detection of genomic DNA include Southern hybridizations. 

Furthermore, in vivo techniques for detection of a polypeptide of the invention include 
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introducing into a subject a labeled antibody directed against the polypeptide. For 
example, the antibody can be labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the 
5 test subject. Alternatively, the biological sample can contain mRNA molecules from 
the test subject or genomic DNA molecules from the test subject. A preferred 
biological sample is a peripheral blood leukocyte sample isolated by conventional 
means from a subject. 

In another embodiment, the methods further involve obtaining a control 

10 biological sample from a control subject, contacting the control sample with a 

compound or agent capable of detecting a polypeptide of the invention or mRNA or 
genomic DNA encoding a polypeptide of the invention, such that the presence of the 
polypeptide or mRNA or genomic DNA encoding the polypeptide is detected in the 
biological sample, and comparing the presence of the polypeptide or mRNA or 

1 5 genomic DNA encoding the polypeptide in the control sample with the presence of 
the polypeptide or mRNA or genomic DNA encoding the polypeptide in the test 
sample. 

The invention also encompasses kits for detecting the presence of a 
polypeptide or nucleic acid of the invention in a biological sample (a test sample). 

20 Such kits can be used to determine if a subject is suffering from or is at increased risk 
of developing a disorder associated with aberrant expression of a polypeptide of the 
invention (e.g., a proliferative disorder, e.g., psoriasis or cancer). For example, the kit 
can comprise a labeled compound or agent capable of detecting the polypeptide or 
mRNA encoding the polypeptide in a biological sample and means for determining 

25 the amount of the polypeptide or mRNA in the sample (e.g,, an antibody which binds 
the polypeptide or an oligonucleotide probe which binds to DNA or mRNA encoding 
the polypeptide). Kits can also include instructions for observing that the tested 
subject is suffering from or is at risk of developing a disorder associated with aberrant 
expression of the polypeptide if the amount of the polypeptide or mRNA encoding the 

30 polypeptide is above or below a normal level. 
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For antibody-based kits, the kit can comprise, for example: (1) a first antibody 
(e.g., attached to a solid support) which binds to a polypeptide of the invention; and, 
optionally, (2) a second, different antibody which binds to either the polypeptide or 
the first antibody and is conjugated to a detectable agent. 
5 For oligonucleotide-based kits, the kit can comprise, for example: (1) an 

oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a 
nucleic acid sequence encoding a polypeptide of the invention or (2) a pair of primers 
useful for amplifying a nucleic acid molecule encoding a polypeptide of the invention. 
The kit can also comprise, e.g., a buffering agent, a preservative, or a protein 

10 stabilizing agent. The kit can also comprise components necessary for detecting the 
detectable agent (e.g., an enzyme or a substrate). The kit can also contain a control 
sample or a series of control samples which can be assayed and compared to the test 
sample contained. Each component of the kit is usually enclosed within an individual 
container and all of the various containers are within a single package along with 

15 instructions for observing whether the tested subject is suffering fi-om or is at risk of 
developing a disorder associated with aberrant expression of the polypeptide. 

2. Prognostic Assays 

The methods described herein can furthermore be utilized as diagnostic or 

20 prognostic assays to identify subjects having or at risk of developing a disease or 
disorder associated with aberrant expression or activity of a polypeptide of the 
invention. For example, the assays described herein, such as the preceding diagnostic 
assays or the following assays, can be utilized to identify a subject having or at risk of 
developing a disorder associated with aberrant expression or activity of a polypeptide 

25 of the invention. Alternatively, the prognostic assays can be utilized to identify a 
subject having or at risk for developing such a disease or disorder. Thus, the present 
invention provides a method in which a test sample is obtained from a subject and a 
polypeptide or nucleic acid (e.g., mRNA, genomic DNA) of the invention is detected, 
wherein the presence of the polypeptide or nucleic acid is diagnostic for a subject 

30 having or at risk of developing a disease or disorder associated with aberrant 

expression or activity of the polypeptide. As used herein, a "test sample" refers to a 
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biological sample obtained from a subject of interest. For example, a test sample can 
be a biological fluid (e.g., serum), cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
5 peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 

candidate) to treat a disease or disorder associated with aberrant expression or activity 
of a polypeptide of the invention. For example, such methods can be used to 
determine whether a subject can be effectively treated with a specific agent or class of 
agents {e.g., agents of a type which decrease activity of the polypeptide). Thus, the 

10 present invention provides methods for determining whether a subject can be 

effectively treated with an agent for a disorder associated with aberrant expression or 
activity of a polypeptide of the invention in which a test sample is obtained and the 
polypeptide or nucleic acid encoding the polypeptide is detected (e.g., wherein the 
presence of the polypeptide or nucleic acid is diagnostic for a subject that can be 

1 5 administered the agent to treat a disorder associated with aberrant expression or 
activity of the polypeptide). 

The methods of the invention can also be used to detect genetic lesions or 
mutations in a gene of the invention, thereby determining if a subject with the 
lesioned gene is at risk for a disorder characterized aberrant expression or activity of a 

20 polypeptide of the invention. In preferred embodiments, the methods include 

detecting, in a sample of cells from the subject, the presence or absence of a genetic 
lesion or mutation characterized by at least one of an alteration affecting the integrity 
of a gene encoding the polypeptide of the invention, or the mis-expression of the gene 
encoding the polypeptide of the invention. For example, such genetic lesions or 

25 mutations can be detected by ascertaining the existence of at least one of: 1 ) a 
deletion of one or more nucleotides from the gene; 2) an addition of one or more 
nucleotides to the gene; 3) a substitution of one or more nucleotides of the gene; 4) a 
chromosomal rearrangement of the gene; 5) an alteration in the level of a messenger 
RNA transcript of the gene; 6) an aberrant modification of the gene, such as of the 

30 methylation pattern of the genomic DNA; 7) the presence of a non-wild type splicing 
pattern of a messenger RNA transcript of the gene; 8) a non-wild type level of a the 
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protein encoded by the gene; 9) an allelic loss of the gene; and 10) an inappropriate 
post-translational modification of the protein encoded by the gene. As described 
herein, there are a large number of assay techniques known in the art which can be 
used for detecting lesions in a gene. 
5 In certain embodiments, detection of the lesion involves the use of a 

probe/primer in a polymerase chain reaction (PGR) {see, e.g., U.S. Patent NOs. 
4,683,195 and 4,683,202), such as anchor PGR or RAGE PGR, or, alternatively, in a 
ligation chain reaction (LGR) {see, e.g., Landegran et al., 1988, Science 241 :1077- 
1080; and Nakazawa et al., 1994, Proc, Natl Acad. Sci. USA 91:360-364), the latter 

10 of which can be particularly useful for detecting point mutations in a gene {see, e.g., 
Abravaya et al., 1995, Nucleic Acids Res. 23:675-682). This method can include the 
steps of collecting a sample of cells from a patient, isolating nucleic acid {e.g., 
genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid 
sample with one or more primers which specifically hybridize to the selected gene 

15 under conditions such that hybridization and amplification of the gene (if present) 
occurs, and detecting the presence or absence of an amplification product, or detecting 
the size of the amplification product and comparing the length to a control sample. It 
is anticipated that PGR and/or LGR may be desirable to use as a preliminary 
amplification step in conjunction with any of the techniques used for detecting 

20 mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli et al, 1990, Proc. Natl. Acad Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et aL,1989, Proc. Natl. Acad. Sci. USA 86:1 173-1 177), 
Q-Beta Replicase (Lizardi et al, 1988, Bio/Technology 6: 1 197), or any other nucleic 

25 acid amplification method, followed by the detection of the amplified molecules using 
techniques well known to those of skill in the art. These detection schemes are 
especially useful for the detection of nucleic acid molecules if such molecules are 
present in very low numbers. 

In an alternative embodiment, mutations in a selected gene from a sample cell 

30 can be identified by alterations in restriction enzyme cleavage patterns. For example, 
sample and control DNA is isolated, amplified (optionally), digested with one or more 
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restriction endonucleases, and fragment length sizes are determined by gel 
electrophoresis and compared. Differences in fragment length sizes between sample 
and control DNA indicates mutations in the sample DNA. Moreover, the use of 
sequence specific ribozymes {see, e.g., U.S. Patent No. 5,498,531) can be used to 
5 score for the presence of specific mutations by development or loss of a ribozyme 
cleavage site. 

In other embodiments, genetic mutations can be identified by hybridizing a 
sample and control nucleic acids, e.g., DNA or RNA, to high density arrays 
containing hundreds or thousands of oligonucleotides probes (Cronin et al., 1996, 

10 Human Mutation 7:244-255; Kozal et al., 1996, Nature Medicine 2:753-759). For 
example, genetic mutations can be identified in two-dimensional arrays containing 
light-generated DNA probes as described in Cronin et al, supra. Briefly, a first 
hybridization array of probes can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between the sequences by making linear 

15 arrays of sequential overlapping probes. This step allows the identification of point 
mutations. This step is followed by a second hybridizafion array that allows the 
characterization of specific mutations by using smaller, specialized probe arrays 
complementary to all variants or mutations detected. Each mutation array is 
composed of parallel probe sets, one complementary to the wild-type gene and the 

20 other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in 
the art can be used to directly sequence the selected gene and detect mutations by 
comparing the sequence of the sample nucleic acids with the corresponding wild-type 
(control) sequence. Examples of sequencing reactions include those based on 

25 techniques developed by Maxim and Gilbert (1977, Proc, Natl, Acad. Sci. USA 
74:560) or Sanger (1977, Proc. Natl. Acad Sci. USA 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures can be utilized 
when performing the diagnostic assays (1995, Bio/Techniques 19:448), including 
sequencing by mass spectrometry (see, e.g., PCT Publication No. WO 94/16101; 

30 Cohen et al., 1996, Adv. Chromatogr. 36:127-162; and Griffin et al., 1993, Appl. 
Biochem. Biotechnol 38:147-159). 
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Other methods for detecting mutations in a selected gene include methods in 
which protection from cleavage agents is used to detect mismatched bases in 
RNA/RNA or RNA/DNA heteroduplexes (Myers et al., 1985, Science 230: 1242). In 
general, the technique of *'mismatch cleavage" entails providing heteroduplexes 
5 formed by hybridizing (labeled) RNA or DNA containing the wild-type sequence 
with potentially mutant RNA or DNA obtained from a tissue sample. The double- 
stranded duplexes are treated with an agent which cleaves single-stranded regions of 
the duplex such as which will exist due to basepair mismatches between the control 
and sample strands. RNA/DNA duplexes can be treated with RNase to digest 

10 mismatched regions, and DNA/DNA hybrids can be treated with SI nuclease to digest 
mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA 
duplexes can be treated with hydroxylamine or osmium tetroxide and with piperidine 
in order to digest mismatched regions. After digestion of the mismatched regions, the 
resulting material is then separated by size on denaturing polyacrylamide gels to 

15 determine the site of mutation. See, e.g.. Cotton et al., 1988, Proc. Natl. Acad. Sci. 
USA 85:4397; Saleeba et al., 1992, Methods EnzymoL 217:286-295. In a preferred 
embodiment, the control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA (so 

20 called **DNA mismatch repair" enzymes) in defined systems for detecting and 

mapping point mutations in cDNAs obtained from samples of cells. For example, the 
mutY enzyme of E. coli cleaves A at G/A mismatches and the thymidine DNA 
glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu et al., 1994, 
Carcinogenesis 15:1657-1662). According to an exemplary embodiment, a probe 

25 based on a selected sequence, e.g., a wild-type sequence, is hybridized to a cDNA or 
other DNA product from a test cell(s). The duplex is treated with a DNA mismatch 
repair enzyme, and the cleavage products, if any, can be detected from electrophoresis 
protocols or the like. See, e.g., U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to 

30 identify mutations in genes. For example, single strand conformation polymorphism 
(SSCP) may be used to detect differences in electrophoretic mobility between mutant 
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and wild type nucleic acids (Orita et aL, 1989, Proa Nati Acad. ScL USA 86:2766; 
see also Cotton, 1993, MutaL Res, 285:125-144; Hayashi, 1992, Genet. Anal. Tech. 
AppL 9:73-79). Single-stranded DNA fragments of sample and control nucleic acids 
will be denatured and allowed to renature. The secondary structure of single-stranded 
5 nucleic acids varies according to sequence, and the resulting alteration in 

electrophoretic mobility enables the detection of even a single base change. The 
DNA fragments may be labeled or detected with labeled probes. The sensitivity of 
the assay may be enhanced by using RNA (rather than DNA), in which the secondary 
structure is more sensitive to a change in sequence. In a preferred embodiment, the 

10 subject method utilizes heteroduplex analysis to separate double stranded 

heteroduplex molecules on the basis of changes in electrophoretic mobility (Keen et 
aL, 1991, Trends Genet. 7:5). 

In yet another embodiment, the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 

15 gradient gel electrophoresis (DGGE) (Myers et al., 1985, Nature 3U'A9S). When 
DGGE is used as the method of analysis, DNA will be modified to insure that it does 
not completely denature, for example by adding a 'GC clamp of approximately 40 bp 
of high-melting GC-rich DNA by PGR. In a finther embodiment, a temperature 
gradient is used in place of a denaturing gradient to identify differences in the 

20 mobility of control and sample DNA (Rosenbaum and Reissner, 1987, Biophys. 
Chem. 265:12753). 

Examples of other techniques for detecting point mutations include, but are 
not limited to, selective oligonucleotide hybridization, selective amplification, or 
selective primer extension. For example, oligonucleotide primers may be prepared in 

25 which the known mutation is placed centrally and then hybridized to target DNA 
under conditions which permit hybridization only if a perfect match is found (Saiki et 
al., 1986, Nature 324:163; Saiki et al., 1989, Proc. Natl Acad. ScL USA 86:6230). 
Such allele specific oligonucleotides are hybridized to PGR amplified target DNA or 
a number of different mutations when the oligonucleotides are attached to the 

30 hybridizing membrane and hybridized with labeled target DNA. 
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Alternatively, allele specific amplification technology which depends on 
selective PGR amplification may be used in conjuncfion with the instant invention. 
Ohgonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
5 hybridization; Gibbs et al., 1989, Nucleic Acids Res. 17:2437-2448) or at the extreme 
3' end of one primer where, under appropriate conditions, mismatch can prevent or 
reduce polymerase extension (Prossner, 1993, Tibtech 1 1 :238). In addition, it may be 
desirable to introduce a novel restriction site in the region of the mutation to create 
cleavage-based detection (Gasparini et al, 1992, Moi Cell Probes 6:1), It is 

10 anticipated that in certain embodiments amplification may also be performed using 
Taq ligase for amplification (Barany, 1991, Proc, Natl Acad. ScL USA 88:189). In 
such cases, ligation will occur only if there is a perfect match at the 3' end of the 5' 
sequence making it possible to detect the presence of a known mutation at a specific 
site by looking for the presence or absence of amplification. 

15 The methods described herein may be performed, for example, by utilizing 

pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, e,g. , in clinical settings to 
diagnose patients exhibiting symptoms or family history of a disease or illness 
involving a gene encoding a polypeptide of the invention. Furthermore, any cell type 

20 or tissue, preferably peripheral blood leukocytes, in which the polypeptide of the 
invention is expressed may be utilized in the prognostic assays described herein. 

3. Pharmacojgenomics 

Agents, or modulators which have a stimulatory or inhibitory effect on activity 

25 or expression of a polypeptide of the invention as identified by a screening assay 
described herein can be administered to individuals to treat (prophylactically or 
therapeutically) disorders associated with aberrant activity of the polypeptide. In 
conjunction with such treatment, the pharmacogenomics (i.e., the study of the 
relationship between an individual's genotype and that individual's response to a 

30 foreign compound or drug) of the individual may be considered. Differences in 
metabolism of therapeutics can lead to severe toxicity or therapeutic failure by 
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altering the relation between dose and blood concentration of the pharmacologically 
active drug. Thus, the pharmacogenomics of the individual permits the selection of 
effective agents (e.g.y drugs) for prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such pharmacogenomics can further be 
5 used to determine appropriate dosages and therapeutic regimens. Accordingly, the 
activity of a polypeptide of the invention, expression of a nucleic acid of the 
invention, or mutation content of a gene of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. 

10 Pharmacogenomics deals with clinically significant hereditary variations in 

the response to drugs due to ahered drug disposition and abnormal action in affected 
persons. See, e.g., Linder, 1997, Clin. Chem. 43(2):254-266. In general, two types of 
pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 
single factor altering the way drugs act on the body are referred to as "altered drug 

15 action," Genetic conditions transmitted as single factors altering the way the body 
acts on drugs are referred to as "altered drug metabolism". These pharmacogenetic 
conditions can occur either as rare defects or as polymorphisms. For example, 
glucose-6-phosphate dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is haemolysis after ingestion of 

20 oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption 
of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 

25 (NAT 2) and cytochrome P450 enzymes C YP2D6 and C YP2C 1 9) has provided an 
explanation as to why some patients do not obtain the expected drug effects or show 
exaggerated drug response and serious toxicity after taking the standard and safe dose 
of a drug. These polymorphisms are expressed in two phenotypes in the population, 
the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is 

30 different among different populations. For example, the gene coding for CYP2D6 is 
highly polymorphic and several mutations have been identified in PM, which all lead 
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to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 
quite frequently experience exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active therapeutic moiety, a PM will 
show no therapeutic response, as demonstrated for the analgesic effect of codeine 
5 mediated by its CYP2D6-fonned metabolite morphine. The other extreme are the so 
called ultra-rapid metabolizers who do not respond to standard doses. Recently, the 
molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 
gene amplification. 

Thus, the activity of a polypeptide of the invention, expression of a nucleic 

10 acid encoding the polypeptide, or mutation content of a gene encoding the 

polypeptide in an individual can be determined to thereby select appropriate agent(s) 
for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles 
encoding drug-metabolizing enzymes to the identification of an individual's drug 

15 responsiveness phenotype. This knowledge, when applied to dosing or drug 
selection, can avoid adverse reactions or therapeutic failure and thus enhance 
therapeutic or prophylactic efficiency when treating a subject with a modulator of 
activity or expression of the polypeptide, such as a modulator identified by one of the 
exemplary screening assays described herein. 

20 

4. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents {e.g., drugs, compounds) on the expression 
or activity of a polypeptide of the invention (e.g., the ability to modulate aberrant cell 
proliferation chemotaxis, and/or differentiation) can be appUed not only in basic drug 

25 screening, but also in clinical trials. For example, the effectiveness of an agent, as 
determined by a screening assay as described herein, to increase gene expression, 
protein levels or protein activity, can be monitored in clinical trials of subjects 
exhibiting decreased gene expression, protein levels, or protein activity. 
Alternatively, the effectiveness of an agent, as determined by a screening assay, to 

30 decrease gene expression, protein levels or protein activity, can be monitored in 
clinical trials of subjects exhibiting increased gene expression, protein levels, or 
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protein activity. In such clinical trials, expression or activity of a polypeptide of the 
invention and preferably, that of other polypeptide that have been implicated in for 
example, a cellular proliferation disorder, can be used as a marker of the immune 
responsiveness of a particular cell. 
5 For example, and not by way of limitation, genes, including those of the 

invention, that are modulated in cells by treatment with an agent (e.g., compound, 
drug or small molecule) which modulates activity or expression of a polypeptide of 
the invention (e.g., as identified in a screening assay described herein) can be 
identified. Thus, to study the effect of agents on cellular proliferation disorders, for 

10 example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for 
the levels of expression of a gene of the invention and other genes implicated in the 
disorder. The levels of gene expression (i.e., a gene expression pattern) can be 
quantified by Northern blot analysis or RT-PCR, as described herein, or alternatively 
by measuring the amount of protein produced, by one of the methods as described 

15 herein, or by measuring the levels of activity of a gene of the invention or other genes. 
In this way, the gene expression pattern can serve as a marker, indicative of the 
physiological response of the cells to the agent. Accordingly, this response state may 
be determined before, and at various points during, treatment of the individual with 
the agent. 

20 In a preferred embodiment, the present invention provides a method for 

monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other 
drug candidate identified by the screening assays described herein) comprising the 
steps of (i) obtaining a pre-administration sample from a subject prior to 

25 administration of the agent; (ii) detecting the level of the polypeptide or nucleic acid 
of the invention in the preadministration sample; (iii) obtaining one or more post- 
administration samples from the subject; (iv) detecting the level the of the polypeptide 
or nucleic acid of the invention in the post-administration samples; (v) comparing the 
level of the polypeptide or nucleic acid of the invention in the pre-administration 

30 sample with the level of the polypeptide or nucleic acid of the invention in the post- 
administration sample or samples; and (vi) altering the administration of the agent to 
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the subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of the polypeptide to higher levels than 
detected, i.e., to increase the effectiveness of the agent. Alternatively, decreased 
administration of the agent may be desirable to decrease expression or activity of the 
5 polypeptide to lower levels than detected, i.e., to decrease the effectiveness of the 
agent. 

C. Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods 

10 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 

associated with aberrant expression or activity of a polypeptide of the invention, e.g., 
cardiac infection (e.g., myocarditis or dilated cardiomyopathy), central nervous 
system infection (eg., non-specific febrile illness or meningoencephalitis), pancreatic 
infection (e.g., acute pancreatitis), respiratory infection (pneumonia), gastrointestinal 

15 infection, type I diabetes, cancer, familia hypercholesterolemia, treat hemophilia B, 
Marfan syndrome, protein S deficiency, allergy, inflammation, and gastroduodenal 
ulcer. Moreover, the polypeptides of the invention can be used to modulate cellular 
function, survival, morphology, proliferation and/or differentiation. 

20 1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a 
disease or condition associated with an aberrant expression or activity of a 
polypeptide of the invention, by administering to the subject an agent which 
modulates expression or at least one activity of the polypeptide. Subjects at risk for a 

25 disease which is caused or contributed to by aberrant expression or activity of a 
polypeptide of the invention can be identified by, for example, any or a combination 
of diagnostic or prognostic assays as described herein. Administration of a 
prophylactic agent can occur prior to the manifestation of symptoms characteristic of 
the aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in 

30 its progression. Depending on the type of aberrancy, for example, an agonist or 
antagonist agent can be used for treating the subject. 
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2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating expression 
or activity of a polypeptide of the invention for therapeutic purposes. The modulatory 
method of the invention involves contacting a cell with an agent that modulates one or 
5 more of the activities of the polypeptide. An agent that modulates activity can be an 
agent as described herein, such as a nucleic acid or a protein, a naturally-occurring 
cognate ligand of the polypeptide, a peptide, a peptidomimetic, or other small 
molecule. In one embodiment, the agent stimulates one or more of the biological 
activities of the polypeptide. Examples of such stimulatory agents include the active 

10 polypeptide of the invention and a nucleic acid molecule encoding the polypeptide of 
the invention that has been introduced into the cell. In another embodiment, the agent 
inhibits one or more of the biological activities of the polypeptide of the invention. 
Examples of such inhibitory agents include antisense nucleic acid molecules and 
antibodies. These modulatory methods can be performed in vitro (e.g., by culturing 

15 the cell with the agent) or, alternatively, in vivo (e.g, by administering the agent to a 
subject). As such, the present invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant expression or activity of 
a polypeptide of the invention. In one embodiment, the method involves 
administering an agent (e.g., an agent identified by a screening assay described 

20 herein), or combination of agents that modulates {e.g., upregulates or downregulates) 
expression or activity. In another embodiment, the method involves administering a 
polypeptide of the invention or a nucleic acid molecule of the invention as therapy to 
compensate for reduced or aberrant expression or activity of the polypeptide. 

Stimulation of activity is desirable in situations in which activity or expression 

25 is abnormally low or dovmregulated and/or in which increased activity is likely to 
have a beneficial effect. Conversely, inhibition of activity is desirable in situations in 
which activity or expression is abnormally high or upregulated and/or in which 
decreased activity is likely to have a beneficial effect. 

The contents of all references, patents and published patent applications cited 

30 throughout this application are hereby incorporated by reference. 
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Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more 
than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by the 
5 following Claims. 
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What is claimed is : 

1 . An isolated nucleic acid molecule selected from the group consisting 

of: 

5 a) a nucleic acid molecule comprising a nucleotide sequence which is at 

least 55% identical to the nucleotide sequence of SEQ ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 
13, 15, 16, 18, 19, 22, 24, 25, the cDNA insert of the plasmid deposited with the 
ATCC® as Accession Number 207220, the cDNA insert of the plasmid deposited with 
the ATCC® as Accession Number 207223, the cDNA insert of the plasmid deposited 
1 0 with the ATCC"^ as Accession Number PTA-248, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number PTA-293, or a complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 300 
nucleotides of the nucleotide sequence of SEQ ID NOs: 1, 3, 4, 6, 7, 9, 10, 12, 13, 15, 
16, 18, 19, 22, 24 or 25, the cDNA insert of the plasmid deposited with the ATCC® as 

15 Accession Number 207220, the cDNA insert of the plasmid deposited with the 

ATCC® as Accession Number 207223, the cDNA insert of the plasmid deposited with 
the ATCC® as Accession Number PTA-248, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number PTA-293, or a complement thereof; 

c) a nucleic acid molecule which encodes a polypeptide comprising the 
20 amino acid sequence of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, the amino acid 

sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207220, the amino acid sequence encoded by the cDNA insert of 
the plasmid deposited with the ATCC® as Accession Number 207223, the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
25 Accession Number PTA-248, or the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with the ATCC® as Accession Number PTA-293; 

d) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, 
the amino acid sequence encoded by the cDNA insert of the plasmid deposited with 

30 the ATCC® as Accession Number 207220, the amino acid sequence encoded by the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207223, 
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the amino acid sequence encoded by the cDNA insert of the plasmid deposited with 
the ATCC® as Accession Number PTA-248, or the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 
PTA-293, wherein the fragment comprises at least 15 contiguous amino acids of SEQ 
5 ID N0s:2, 5, 8, 1 1, 14, 17, 20, 23, or 91, the amino acid sequence encoded by the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207220, 
the amino acid sequence encoded by the cDNA insert of the plasmid deposited with 
the ATCC*^ as Accession Number 207223, the amino acid sequence encoded by the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number PTA- 

10 248, or the amino acid sequence encoded by the cDNA insert of the plasmid deposited 
with the ATCC® as Accession Number PTA-293; and 

e) a nucleic acid molecule which encodes a naturally occurring allelic 
variant of a polypeptide comprising the amino acid sequence of SEQ ID N0s:2, 5, 8, 
11,14, 17, 20, 23, or 91, the amino acid sequence encoded by the cDNA insert of the 

1 5 plasmid deposited with the ATCC® as Accession Number 207220, the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207223, the amino acid sequence encoded by the cDNA insert of 
the plasmid deposited with the ATCC® as Accession Number PTA-248, or the amino 
acid sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® 

20 as Accession Number PTA-293, wherein the nucleic acid molecule hybridizes to a 
nucleic acid molecule comprising SEQ EDNOs: 1, 3,4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 
19, 22, 24, 25, or a complement thereof, under stringent conditions. 

1. The isolated nucleic acid molecule of Claim 1, which is selected from 
25 the group consisting of: 

a) a nucleic acid comprising the nucleotide sequence of SEQ ID NOs: 1 , 
3,4, 6, 7, 9,10, 12, 13, 15, 16, 18, 19, 22, 24 or 25, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207220, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207223, the cDNA insert of 
30 the plasmid deposited with the ATCC® as Accession Number PTA-248, the cDNA 
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insert of the plasmid deposited with the ATCC® as Accession Number PTA-293, or a 
complement thereof; and 

b) a nucleic acid molecule which encodes a polypeptide comprising the 
amino acid sequence of SEQ IDN0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, the amino acid 
5 sequence encoded by the cDNA insert of the plasmid deposited with the ATCC* as 
Accession Number 207220, the amino acid sequence encoded by the cDNA insert of 
the plasmid deposited with the ATCC® as Accession Number 207223, the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number PTA-248, or the amino acid sequence encoded by the cDNA 
10 insert of the plasmid deposited with the ATCC® as Accession Number PTA-293. 

3. The nucleic acid molecule of Claim 1 fiuther comprising vector 
nucleic acid sequences. 

15 4. The nucleic acid molecule of Claim 1 further comprising nucleic acid 

sequences encoding a heterologous polypeptide. 

5. A host cell which contains the nucleic acid molecule of Claim 1 . 

20 6. The host cell of Claim 5 which is a mammalian host cell. 

7. A non-human mammalian host cell containing the nucleic acid 
molecule of Claim 1 . 

25 8. An isolated polypeptide selected from the group consisting of 

a) a fragment of a polypeptide comprising the amino acid sequence of 
SEQ ID N0s:2, 5, 8, 1 1 , 14, 1 7, 20, 23, or 91, wherein the fragment comprises at least 
15 contiguous amino acids of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91; 

b) a naturally occurring allelic variant of a polypeptide comprising the 
30 aminoacidsequenceofSEQIDNOs:2, 5, 8, 11, 14, 17, 20, 23, or 91, the amino acid 

sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
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Accession Number 207220, the amino acid sequence encoded by the cDNA insert of 
the plasmid deposited with the ATCC® as Accession Number 207223, the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number PTA-248, or the amino acid sequence encoded by the cDNA 
5 insert of the plasmid deposited with the ATCC® as Accession Number PTA-293, 
wherein the polypeptide is encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule comprising SEQ ID NOs: 1, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 
19, 22, 24, 25, or a complement thereof under stringent conditions; and 

c) a polypeptide which is encoded by a nucleic acid molecule comprising 
10 a nucleotide sequence which is at least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NOs: 1,3,4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 19, 22, 24, 
25, or a complement thereof 

9. The isolated polypeptide of Claim 8 comprising the amino acid 
15 sequence of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91. 

10. The polypeptide of Claim 8 further comprising heterologous amino 
acid sequences. 

20 11. An antibody which selectively binds to a polypeptide of Claim 8. 

12. A method for producing a polypeptide selected from the group 
consisting of: 

a) a polypeptide comprising the amino acid sequence of SEQ ID NOs:2, 
25 5, 8, 1 1, 14, 17, 20, 23, or 91, the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207220, the amino 
acid sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® 
as Accession Number 207223, the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number PTA-248, or the 
30 amino acid sequence encoded by the cDNA insert of the plasmid deposited with the 
ATCC® as Accession Number PTA-293; 
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b) a polypeptide comprising a fragment of the amino acid sequence of 
SEQ ID N0s:2, 5, 8, 1 1, 14, 17, 20, 23, or 91, the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 
207220, the amino acid sequence encoded by the cDNA insert of the plasmid 

5 deposited with the ATCC® as Accession Number 207223, the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number PTA-248, or the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number PTA-293, wherein the 
fragment comprises at least 15 contiguous amino acids of SEQ ID N0s:2, 5, 8, 11, 14, 

10 17, 20, 23, or 91, the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207220, the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207223, the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number PTA-248, or the amino acid 

15 sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number PTA-293; and 

c) a naturally occurring allelic variant of a polypeptide comprising the 
amino acid sequence of SEQ ID N0s:2, 5, 8, 11, 14, 17, 20, 23, or 91, the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 

20 Accession Number 207220, the amino acid sequence encoded by the cDNA insert of 
the plasmid deposited with the ATCC® as Accession Number 207223, the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number PTA-248, or the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with the ATCC® as Accession Number PTA-293, 

25 wherein the polypeptide is encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule comprising SEQ ID NOs:l, 3, 4, 6, 7, 9, 10, 12, 13, 15, 16, 18, 
19, 22, 24, 25, or a complement thereof under stringent conditions; 

comprising culturing the host cell of Claim 5 imder conditions in which the 
nucleic acid molecule is expressed. 

30 
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13. A method for detecting the presence of a polypeptide of Claim 8 in a 
sample, comprising: 

a) contacting the sample with a compound which selectively binds to a 
polypeptide of Claim 8; and 
5 b) determining whether the compound binds to the polypeptide in the 

sample. 

14. The method of Claim 13, wherein the compound which binds to the 
polypeptide is an antibody. 

10 

15. A kit comprising a compound which selectively binds to a polypeptide 
of Claim 8 and instructions for use. 

16. A method for detecting the presence of a nucleic acid molecule of 
15 Claim 1 in a sample, comprising the steps of: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a 
nucleic acid molecule in the sample. 

20 

1 7. The method of Claim 16, wherein the sample comprises mRNA 
molecules and is contacted with a nucleic acid probe. 

18. A kit comprising a compound which selectively hybridizes to a nucleic 
25 acid molecule of Claim 1 and instructions for use. 

19. A method for identifying a compound which binds to a polypeptide of 
Claim 8 comprising the steps of: 

a) contacting a polypeptide, or a cell expressing a polypeptide of Claim 8 
30 with a test compound; and 

b) determining whether the polypeptide binds to the test compound. 
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20. The method of Claim 19, wherein the binding of the test compound to 
the polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detecting of test compound/polypeptide 
binding; 

5 b) detection of binding using a competition binding assay; 

c) detection of binding using an assay for TANGO 244-, TANGO 246-, 
TANGO 275-, TANGO 300-, or MANGO 245-mediated signal transduction. 

21 . A method for modulating the activity of a polypeptide of Claim 8 
10 comprising contacting a polypeptide or a cell expressing a polypeptide of Claim 8 

with a compound which binds to the polypeptide in a sufficient concentration to 
modulate the activity of the polypeptide. 

22. A method for identifying a compound which modulates the activity of 
1 5 a polypeptide of Claim 8, comprising: 

a) contacting a polypeptide of Claim 8 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity of the 
polypeptide. 

20 
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GTCGACCCACGCGTCCCGCCCCACACCCAGACCCACCTToACCGCCCACwTGGCAGGAGCAGGACAGGACGGCCGGACG 7^1 

MAE L ? G ? F C O A L G r L C 

wGGCC ATG GCC GAG CTC CCG GGG CCC TTT CTC TGC GCG GCC CTG CTA GGC TTC CTG TGC 

;^ J 7 3 o 3 M ? C S Q i G Q T S V G G 

rXG AGT GTT CCC CCC AGT P^T CCC 7TA TGC AGT CAG AGT GGA CAA ACC TCT GTG GGA GGC '"/V 

S - A L R C 5 3 S c G A ? K ? 7 Y ^ 
TCT ACT GCA CTG AGA TGC AGC TCT TCC GAG GGG GCT CCT AAG CCA GTG TAG AAC TGG GTG ZS^ 

R L G r F ?t?S?G5M VQDEVSg 
CGT CTT GGA ACT TTT CCT ACA CCT TCT CCT GGC AGC ATG GTT CAA GAT GAG GTG TCT GGC ^'^ 

qliltmi-sltssgty rcvat 
cag ctc att ctc acc aac ctc tcc ctg acc tcc tcg ggc acc tac cgc tgt gtg gcc acci'?^^ 

NQMGS ASCELTuSVTEPSQG 
^JKC CAG ATG GGC AGT GCA TCC TGT GAG CTG ACC CTC TCT GTG ACC GAA CCC TCC CAA GGC ^33 

^ • r ^ Q \ ^ I G 1- ' J " ^ ^ 5 7 A A 

CGA GTG GCC GGA GCT CTG ATT GGG GTG CTC CTG GGC GTG CTG TTG CTG TCA GTT GCT GCG^^'l'i 

f C L V R r^KERGKKPKETYGG 
TTC TGC CTG GTC AGG TTC CAG AAA GAG AGG GGG AAG AAG CCC AAG GAG ACA TAT GGG GGT555 

S D L R ' 
AGT GAC CTT CGG TGA S73 

GCAGGAGGGCTGGGGGGTGGCGCAAGGAGGGAGGAAAGGGCTTG AGTTAAAAGCGGGTGCCTGCAACCCTCAAACTC ^ '->^ 
ACATCATTCAGTGTGTTTAGGGGCAGGAGGTGTTGTTCAGCCGTGGAATTTGCTGGTGGCAGC^^ "7^' 
TTGAGGGTACAGGCAAGCGGTACAGGGTGGAGTGGCTGGTCCACAAGCTGTGGCAGGGAAGCTGTTTGCAGGACTGCCC V'^"* 
TGCCCCTCCTCATATTTAATAAAGTTTACTTTTCTGTTCCGAAGGTATTTTCATATATTTTAACCACCTGGGAGTA^^^ 
3TGGCTTGTAGATGCCAGGAAATGGATTTGTCCTGAGCAGTCAGCTGAGTTCAATTCTTCTGTGGAGGAAATCAGGAAA 
3GGGAGGGGAAACTGCCTCTGTCATCCACTTTAGCTGCCAGNCAGGGTCTAGGATAGGGATCAGAGCAACATTTCTTCA 

1 

GGTGGAGTCCTCACSATrACCTGGACAGAAATCACCGGGAACTAGTTATACATTCAGATTCAGGCCACTTCT^ 

TXrrAGTTGTGCGTTCGGGAGTGATNAGGCCCANAAATTTCOTTTTAACCA^ ' ^ 

G AAArrrAAGAGTCCCCAGGirAGAGGAaMCCCTCCNCCGCAGGAGGOTTTTACTGKTTACTCAG^^ ' ^ ^ 
CCATCAGGGAGGATGCCATCGCTCCTGGGATCTCTGAGCACACTTGTATGAGGGCTGAriCT 

AAGACCCTCGTCTGCCAGCACCGTGACGACCACCAAGTCCAAGCTCCCTATGGTCGTGTGACTTCTC ' '''' ^ 

GGCGGTGAGGGGGAATATCAATAATTAAAGTCTGTGGGTACCAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 1513 
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ALIGN caiculaces a giobai diigrrmenc oc cv/o sequencl^ 

•/erfiion 2,0uPIeas€ cice: Myers and Miller. CABIOS (19891 
> hT244 a. a. aa vs 

GeaPepc AF06I022 - Homo sapiens CTH gene, compi 325 aa 
scoring matrix: paml20.mac. ^ap penalties: -L2/-4 
48.6% Ldencicy; Global alignmenc score: 149 

iO 20 

mpucs KAELPGPFLCGALLGFLCLS 



MADLPGPFLCGALLGFLCLXLAVE^/KVPTEPLSTPLGKTAELTCTYSTSVGDTFALEWSF 
10 20 30 40 SO 60 



inputs 



inputs 



VQPGKPISESHPILYFTNGHLYPTGSKSKRVSLLQNPPTVGVATLKLTDVHPSDTGTYLC 

70 SO 90 100 110 120 

30 40 SO 
VPPSNPLCSQSGQTSVGGSTALKCSSSEGAPKPVYNWVR 



QVNNPPDFYTNGLGLINLTVLVPPSNPLCSQSGQTSVGGSTAUICSSSEGAPKPVYWVR 
130 140 150 160 170 180 

60 70 80 90 100 110 

inputs LGTFPTPSPGSMVQDEVSGQLII,TITI.SLTSSGTYRCVATNQMGSASCELTLSVTEPSQGR 

LGTFPTPSPGSMVQDEVSGQLILTNLSLTSSGTYRCVATNQLGSASCELTLSVTEPSQGR 
190 200 210 220 230 240 

120 130 140 150 160 
inputs VAGALIGVLLGVLLLSVAAFCL'-'RFQKERGKKPKETYGGSDLR 

VTGAI.IGVT.LGVLLLSVAAFCL'>mFQKERGKKPKETYGGSDIJl£DAIAPGis^^ 

250 260 270 280 290 300 



inpucs. - • 



SSKGFLERP S ASTVTTTKSKLPMW 
310 320 
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rnpuc riie Athsa3 4d2 . seq; Oucduc file A chsa J ^d2 . pac 
Sequence Length ^002 

GTCCACCCACGCGTCCGCAACATCCTCGCTTAGTATTCTGTGCAAAATCAGAGAGGGGTGCAAGATCCTGATTTITCAG 
GAGrrCXAGCGACA a!g GCA g3c -Ia TAG GGC AGT a?g aSc TTC A^C CCC AGC aJa OCX GM Hi 
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TTG 


CTG 


G 
GGC 


CTG 


I 

ATC 


Q 
CAG 


R 
CGC 


V 
GTG 


'/ 


R 
AGG 


GAG 


K 
AAG 


GAA 


G 
GGC 


C 
TGC 


V- 
TAC 


T 
ACT 




AGG 


A 
ATG 


V 

TAC 


Q 
C.\A 


R 
AGG 


A 
GCG 


GAG 


GAG 


GAG 


ATC 


Q 
CAG 


AAG 


T 
ACA 


Q 
CAA 


A 
GCA 


H 
ATG 


Q 
CAA 


GAA 


CTC 


H 
CAC 


R 
AGA 


V 
GTG 


E 
GAG 


CTG 


E 

GAG 


R 
AGA 


GAG 


AAA 


A 
GCG 


R 
CGG 


ATA 


E 
GAG 


E 
GAG 


Y 
TAT 


E 
GAA 


E 
GAG 


K 
AAA 


r 

ATC 


R 
AGA 


K 
AAG 


L 
CTG 


E 
GAA 


D 
GAT 


K 
AAA 


V 
GTG 


GAG 


Q 
CAG 


£ 

GAA 


K 
AAG 


R 
AGA 


K 
AAG 


Q 
CAA 


M - 
ATG 


GAG 


K 
AAG 


K 
AAA 


L. 
CTA 


A 
GCA 


E 
GAA 


Q 
CAG 


E 
GAG 


A 
GCT 


H 
CAC 


V 

TAT 


A 
GCT 


V 
GTA 


R 
AGG 


Q 
CAG 


Q 
CAA 


A 
GCA 


R 
AGA 


T 
ACG 


E. 
GAA 


V 
GTG 


E 
GAG 


S 
AGT 


K 
AAG 


D 
GAT 


G 
GGG 


I 

ATA 


L 
CTT 


E 
GAA 


L 

TTA 


I 

ATC 


M 
ATG 


T 
ACA 


A 
GCG 


L 
TTA 


r 

ATT 


A 
GCT 


S 
TCC 


F 
TTT 


r 

ATT 


L 
TTG 


L 
TTA 


R 
CGT 


L 

CTG 


F 
TTC 


A 
GCG 


E 
GAA 


D 
GAT 


* 

TAA 


330 
1083 









w 



K 
AAG 

K 

Q 
ZM 

A 

sen 

N 
\AC 

Q 
ZAP 

R 
\GJ 

K 
^AG 

R 

hGC 

Q 



ACTTAATGAAAATCTGTTTGTATTTTCTGCATATTCTCTGGCAACCTTGCCCCATACTTACTTAr^ 

TGCTCTAGTTTCTGTCTCTCAGGCACTCGTAACTAAGGACCACCATTGGCCATTGGTAGATGTTTGATTGAC'rrAACAA /IW/ 
GAGAGGGAC AAATTTTCAATTTGTGAAACTCCAAAGCAG AAAGTATTGGTGCTTGCTACCTTGTGAATTCTTCCTTAGA / S 
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: ATGCAG AGAAAATCTATGCAaJRcC.^AAAAGATGGCTCC.^AGCT ATGTCATGTT^?CTAATAAAATCrrTTCT^ U 9 9 
CTAGATTCrnCTATCriTGGCAGATAATCTCCCCTTGTAGCTTCC ACTCACTTATT^ 1478 
CATCTTACCCATGTGGrnTTGAGAAAGAAAGATCAATTCTTTGTrTGCAGTAGGTAATCrrAGAGATGGAGATOAT^ 1557 
TAGAATTArrCCTAGATGAGTGTCAArrTATTT.^ArrCCATTGTCATATAAGGAGTCAAATTGTTTCT^ 1636 
CATTCAAGAACAGAGACCTGTCTGGAAAATCGATCTCTACAAATTCAATTAAATAATGATCCCCAAATGCTGAAAAAGT 17 IS 
GAAATACAGCAATTCAACAGATAATAGAGCAATGTrrAGTATATTCAGCTGTATCTGTAGAAACTCTTTGA 1794 
AATTTAACCAATTTGATGAATACCCAGT-rCTCTTCTTTTCTAGAGAAAGATAGTTCCAACCTC^^ 1873 
ACTTTGAATACCTATTGTTTGGCAGGTCATCCACACACTTCTCCCCCCACTCCATTCAATT^^ 1952 
TAATAAAACTTTTCAATTATCTCAAAAAAAAAAAAAAAAA mSl 
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I i : ' i i • : ' : I • I ' M ■ 1 ' I ' ' [ 
^ '^l 81 121 ISI 201 241 281 321 



im 
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CohTSe^J^OS *->kGfdFTLMWGeSGLGKtTlINTLFlcdLidangvanDsreidgase 

^ ^-VG G GK++ ♦ 4- + + ^ + 

TA^foGO :~ RNSQLRIVLVGKTGAGKSATGWS:'-.GSKVFHSGL- aAK 63 

ckiklccvelkexckvej-EEdCvkLnLTv LDTPG . TGDa iCNskcWepIve 
r,4-+ * * L V + DTPG F - + + ^ i^.^. 

-.4 -SI--TKKCE-KRSSSWKET ELVWDTPG i FDTEVPNAETSKEI IR 105 

YldeQheqYLrqEsrinRtkivDnRVHcCLYFIsPtGhgLkpLDvefMKk 
1,06 CILLTSPGPHALL-L-VVPLGR L25 

LsekVNlIPV-IAKADcLTadElqef Kkrlreei . . . . erqnIklYkFPde 
1^26 YTEEEHKAT-EKILKMFgeraRSFMrLIFT--RK 156 

eedeCDEefkeqcqqLkssiPFAIVGSneeiengdGekVRGRkYPWGvVE 

L57 DDLG-DTN LHDYtRE APEDIQD LMD 180 

\AENpsHCDFvkLRnlLircHLqDLkecTeeiiYEnYRseKLsalglkaen 
+ t-L n k c +e't- E R^-+ L ^+ •*"*■ 

181 IFGDRYC ALNN KATGAEQ- - EAQRAQLLGLIQRVTOE 215 
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CavJSGV)Su5 ' ->GevialvGpNGaGKSTLLklxsGilp pceGtilldCardlr 

^- + +IvG t-GaGKS ^ iKS ^-t-^-h ^ t + ^- r+ ^ 
TAVJGOLMO 30 QLRIVLVGKTGAGKSATGNSILGRKVf hsgcaAKSITKKCEK-RSSS 75 

. IskDcerierirknigwtQdpcifpnveicvreaiaf girls 

+ e+ -t- + + ^--t-c pn c +e'fi+ 1 

76 wKE TELW--\nyrPGIFDTEVPN-AETSKEIIRCILLTSpgphal 117 

IglskdeqrarllOcagaeelLerlglgydhlldrrpgcLSGGqk 

lie llwplGRYTEEEHKAT EKILKMFGER ARS 147 

QRvaiARaLltkpklLlLDEPTagLDpasraqllellrelrqqggTvlIi 
+ +-t-c+ D^- * ^-1 + + lre 
148 FMILIFTRK DDLGDTNLHDYLREAPE 173 

cHdldlldrla.DrilvledG<- ' 
d+ + + l + + +' +Dr ^ It--*- 
174 --OIQDLMDIFgDRYCALNNK 192 
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M R G 3 

GTCGACCCACGCGTCCGCCCGOATGCCCGGGCCCCGAGGGGCTGCTGGCG^ ATG CGC GGG 73 

AGAAGLLALLLLLLLLLLGL 23 

GCG GGG GCG GCG GGG CTG CTG GCG CTG CTG CTG CTG CTG CTG CTG CTG CTG CTG GGC CTG 133 

GGRVEGGPAGERGAOGGGAL 43 

GGC GGC AGG GTC GAG GGG GGG GCG GCC GGC GAG CGG GGC GCA GOC GGG GGC GGG GCG CTG 193 

ARERFKVVFAPVICKRTCLK 63 

GCC CGC GAG CGC TTC AAG GTG GTC TTT GCG CCG GTG ATC TGC AAG CGG ACC TGT CTC AAG 2 53 

GQCRD SCQQGSNMTLIGEN G 83 

GGC CAG TGT CGG GAC AGT TGT CAG CAG GGC TCC AAC ATG ACG CTC ATC GGA GAG AAC GGC 313 

HSTDTLTGSGFRVVVCPLPC 103 

CAC AGC ACA GAC ACG CTC ACG GGC TCC GGC TTC CGC GTG GTG GTG TGC CCT CTC CCC TGC 3 73 

MNGGQCSSRNQCLCPPDFTG 123 

ATG AAT GGC GGC CAG TGC TCC TCG CGA AAC CAG TGC CTG TGT CCC CCG GAC TTC ACT GGG 4 33 

RFCQVPAGGAGGGTGGSGPG 143 

CGC TTC TGC CAG GTG CCC GCA GGA GGA GCC GGT GGG GGT ACC GGC GGC TCA GGC CCC GGC 4 93 

LSRTGALSTGAL PPLAPEGD 163 

CTG AGC AGG ACA GGG GCC CTG TCC ACA GGG GCG CTG CCG CCC CTG GCT CCG GAG GGC GAC 5 53 

SVASKHAIYAVQVIADPPGP 183 

TCT GTG GCC AGC AAG CAC GCC ATC TAC GCC GTC CAG GTG ATC GCT GAC CCT CCT GGG CCC 613 

GEGPPAQHAAFLVPLGPGQI 203 

GGG GAG GGG CCT CCT GCC CAG CAC GCA GCC TTC CTG GTG CCC CTA GGC CCG GGA CAG ATC 673 

5;^EVQAPPPVVNVRVHHPPE 223 

TCA GCA GAA GTG CAG GCC CCG CCC CCC GTG GTG AAT GTG CGC GTC CAT CAC CCG CCC GAG 7 33 

;^SVQVHRIESSNAESAAPSQ 243 

GCC TCA GTC CAG GTG CAC CGC ATT GAG AGC TCG AAC GCC GAG AGC GCA^GCC CCC TCC CAG 7 93 

HLLPHPKPSHPRPPTQKPLG 263 

CAC CTG CTG CCG CAC CCC AAG CCC TCG CAC CCT CGG CCG CCC ACC CAG AAG CCC CTG GGC 853 

ReFQDTLPKQPCGSNPLPGL 283 

CGC TGC TTT CAG GAC ACT CTG CCC AAG CAG CCG TGT GGC AGC AAC CCC CTC CCC GGC CTC 913 

TiCaEDCCGSIGTAWGQSKCH 303 

ACC AAG CAQ QAA GAC TGC TGC GGT AGC ATC GGC ACT GCC TGG GGC CAG AGC AAG TGC CAC 973 

KCPQLQYTGVQKPGPVRGEV 323 

AAG TGT CCC CAG CTG CAG TAC ACA GGA GTG CAG AAG CCA GGG CCT GTA CGT GGG GAA GTG 1033 

GADCPQGYKRLNSTHCQDIN 343 

GGC GCT GAC TGT CCC CAG GGC TAC AAG AGG CTT AAC AGC ACC CAC TGC CAG GAC ATC AAC 1093 



ECAMPGVCRHGDCLNNPG 



363 



GAG 



TGC GCA ATG CCG GGC GTG TGT CGC CAT GGT GAC TGC CTC AAC AAC CCT GGC TCC TAT 1153 
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R C V C P P G 

CGC TGT GTC TOC CCA OCT GGC 

K ? E E K S L 

AAA CCG GAG GAG AAG AGC CTG 

P L T T R L T 

CCA CTG ACC ACC CGC CTG ACC 

A R C Q R C P 

GCG CGG TGT CAG CGC TGC CCA 

G K G Y H r L 

GGG AAG GOA TAC CAC ATT CTC 

F S L F L H P 

TTT TCC CTT TTC CTG CAC CCT 

S Q A P P P E 

AGC CAG GCT CCA CCA CCT GAG 

V S E E R S V 

GTG AGT GAG GAG AGG TCA GTG 

P Y P E L r S 

CCC TAC CCC GAG CTG ATC TCC 

L P P S R S A 

TTG CCT CCT TCC CGC AGC GCC 

C R L N Q N I 

TGC CGA CTG AAC CAG AAC ATC 

S C H C N P G 

TCC TGC CAC TGC AAC CCC GGC 

M C E A E P 

AAC GAG TGC GAG GCA GAG CCC 

S Y N C H C N 

TCC TAC AAT TGC CAC TGC AAC 

C V D L N E C 

TGC GTG GAC CTG AAC GAA TGC 

N F G H Y K 

AAC TTT CCC GGT CAC TAC AAG 

p p V C E D I 

CCT CCT GTQ TGC GAA GAC ATC 

C E N K P G S 

TGC GAG AAC AAG CCC GGG AGC 

G G O A C R D 

GGG GGC GGG GCC TGT CGC GAC 

W C E N L P G 

TGG TGC GAG AAC CTC CCG GGC 



H S L G P S R 
CAT AGT TTA GGC CCC TCC CGT 

C F R L V S P 
TGT TTC CGC CTG GTG AGC CCT 

R Q L C C C S 
CGC CAG CTC TGC TGC TGC AGT 

T D G T A A F 
ACA GAT GGC ACC GCT GCG TTC 

T S H Q T L T 

ACC TCC CAC CAG ACG CTC ACC 

D G P P K P Q 
GAC GGG CCA CCC AAG CCC CAG 

D T E E E R G 
GAC ACA GAG GAA GAG AGA GGG 

Q 0 S H P T A 
CAG CAG AGC CAC CCA ACT GCC 

R P S P P T M 
CGT CCC TCG CCC CCG ACC ATG 

V E I V A P" T Q 
GTA GAG ATC GCT CCC ACT CAG 

C G . H G E C V 
TGT GGC CAC GGA GAG TGC GTG 

V R S H P Q H 
TAC CGG TCA CAT CCC CAG CAC 

C G P G R G I 
TGT GGC CCG GGG AGG GGC ATC 

R G Y R L H V 
CGC GGC TAC CGC CTG CAC GTG 

A K P H L C G 
GCC AAG CCC CAC CTG TGC GGC 

C N C Y P G Y 
TGC AAC TGC TAC CCC GGC TAC 

D E C R D P S 
GAC GAG TGC CGG GAC CCA AGC 

F K C r A C Q 
TTC AAG TGC ATC GCC TGT CAG 

V N E C A E G 
GTG AAC GAG TGC GCC GAG GGC 

S F R C T C A 
TCC TTC CGC TGC ACC TGT GCC 
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T Q C I A D 383 

ACA CAG TGC ATT GCA GAC 1213 

E H Q C Q H 403 

GAG CAC CAG TGC CAG CAC 12 73 

V G K A W G 423 
GTC GGC AAG GCC TGG GGC 1333 

K E I C P A 443 

AAG GAG ATC TGC CCA GCT 13 93 

I 0 G E S D 463 

ATT CAG GGC GAG AGT GAC 14 53 

Q L P E S p 483 

CAG CTT CCG GAG AGC CCT 1513 

V T T D S P 503 
GTG ACC ACG GAC TCA CCG 1573 

T T T P A R 523 

ACC ACG ACT CCT GCC CGG 1633 

R W F L P D 543 

CGC TGG TTC CTG CCG GAC 16 93 

V T E T D E 563 
GTC ACA GAG ACT GAT GAG 1753 

P G P P D Y 583 

CCG GGC CCC CCT GAC TAC X813 

R Y C V D V 603 

CGC TAC TGC GTG GAT GTG 18 73 

C M N T G G 623 

TGC ATG AAC ACC GGC GGC 1933 

G A V G G R S 643 

GGC GCC GGG GGG CGC TCG 1993 

D G G F C I 663 

GAC GGC GGC TTC TGC ATC 2053 

R L K A S R 683 

CGG CTC AAA GCC TCC CGG 2113 

S C P D G K 703 

TCT TGC CCG GAT GGC AAA 2173 

P G Y R S Q 723 

CCT GGC TAC CGC AGC CAG 2233 

S P C S P G 743 

AGC CCC TGC TCG CCT GGC 2 2 93 

Q G Y A P A 763 

CAG GGC TAC GCG CCC GCG 2 3 53 
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p D G R LDVDECiA D V C D N 783 

CCC GAC GGC CGC AOT TGC TTQ GAT GTG GAC GAG TGT GAG GCT GGQ GAC GTO TGT GAC AAT 2413 

GXCSNTPGSFQCQCLSGYHL 803 

GGC ATC TGC AGC AAC ACG CCA GGA TCT TTC CAQ TGT CAG TGC CTC TCT GGC TAC CAT CTG 2473 

SRDRSHCEDIDECDFPAACI 823 

TCC AGG GAC CGG AGC CAC TGC GAG GAC ATT GAT GAG TGT GAC TTC CCT GCA GCC TGC ATT 2 S3 3 

GGDCINTNGSYRCLCPQGHR 843 

GGG GGT GAC TGC ATC AAT ACC AAT GGC TCC TAC AGA TGT CTT TGC CCC CAG GGG CAT CGG 2593 

LVGGRKCQDIDECSQDPSLC 863 

CTG GTG GGT GGC AGG AAA TGC CAA GAC ATA GAT GAG TGC AGC CAG GAC CCG AGC CTG TGC 2653 

LPHGACKNLQGSYVCVCDEG 803 

CTT CCC CAT GGG GCC TGC AAG AAC CTT CAG GGC TCC TAT GTG TGT GTC TGC GAT GAG GGC 2713 

FTPTQDQHGCEEVEOPHHKK 903 

TTC ACT CCC ACC CAG GAC CAG CAC GGT TGT GAG GAG GTG GAG CAG CCC CAC CAC AAG AAQ 2773 

ECYLKFDDTVFCDSVLATNV 923 

GAG TGC TAC CTG AAC TTC GAT GAC ACA GTG TTC TGC GAC AGC GTA TTG GCC ACC AAC GTG 2 833 

XOQECCCSLGAGWGDHCEIY 943 

ACC CAG CAG GAG TGC TGC TGC TCT CTG GGG GCC GGC TGG GGC GAC CAC TGC GAA ATC TAC 2893 

PCPVYSS AEF HSLCPDGKGY 963 

CCC TGC CCA GTC TAC AGC TCA GCC GAG TTC CAC AGC CTC TGC CCA GAC GGA AAG GGC TAC 2953 

TQDNNIVNYGIPAHRDIDEC 983 

ACC CAG GAC AAC AAC ATC GTC AAC TAC GGC ATC CCA GCC CAC CGT GAC ATC GAC GAG TGC 3 013 

MLFGSEICKEGKCVNTQPGY 1003 

ATG TTG TTC GGG TCG GAG ATT TGC AAG GAG GGC AAG TGC GTG AAC ACG CAG CCT GGC TAC 3 073 

ECYCKQGFYYDGNLLECVDV 1023 

GAG TGC TAC TGC AAG CAG GGC TTC TAC TAC GAC GGG AAC CTG CTG GAA TGC GTG GAC GTG 313 3 

DECLDESNCRNGVCENvTRGG 1043 

GAC GAG TGC CTG GAC GAG TCC AAC TGC CGG AAC GGA GTG TGT GAG AAC ACG CGC GGC GGC 3193 

YRCACTPPAEYSPAQRQCLS 1063 

TAC CGC TGT GCC TGC ACQ CCC CCT GCC GAG TAC AGT CCC GCG CAG CGC CAQ TGC CTG AGC 3 253 

P ES5---:K DVDECQDPAACRPGRC 1083 

CCG GAA OAa ATO GAC GTG GAC GAG TGC CAG GAC CCG GCA GCC TGC CGC CCT GGC CGC TGC 3 313 

VNLPGSYRCECRPPWVPGPS 1103 

GTC AAC CTG CCG GGC TCC TAC CGC TGC GAG TGT CGC CCG CCC TGG GTG CCC GGG CCC TCC 3373 

GRDCQLPESPAERAPERRDV 1123 

GGC CGC GAT TGC CAG CTC CCC GAG AGC CCG GCC GAG CGT GCC CCG GAG CGG CGC GAC GTG 3433 

CWSQRGEDGMCAGPLAGPAL 1143 

TGC TGG AGC CAG CGC GGA GAG GAC GGC ATG TGC GCT GGC CCC CTG GCC GGG CCT GCC CTC 3493 

-yPDDCCCRQGRGWGAQCRPC 1163 

ACC TTC GAC GAC TGC TGC TGC CGC CAG GGC CGC GGC TGG GGC GCC CAA TGC CGA CCG TGC 3 553 
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PPRGAGSHCPTSQSESNSFW iie3 

CCG CCG CGC GGC GCG GGG TCC CAT TGC CCG ACA TCG CAG AGC GAG AGC AAT TCC TTC TGG 3 613 

OTSPLLLGKPPRDEDSSEED 1203 

GAC ACA AGC CCC CTG CTG TTG GGG AAG CCC CCA AGA GAT GAG GAC AGT TCA GAG GAG GAT 3 673 

SDECRCVSGRCVPRPGGAAC 1223 

TCA GAC GAG TGT CGC TGC GTG AGT GGC CGC TGC GTG CCG CGG CCG GGC GGC GCC GCG TGC 3 73 3 

ECPGG FQLDASRARCVDIDE 1243 

GAG TGT CCC GGC GGC TTC CAG CTC GAC GCC TCC CGC GCC CGC TGC GTG GAT ATC GAC GAG 3 7 93 

CRELNQRGLLCKSERCVNTS 1263 

TGC CGA GAG CTG AAC CAG CGC GGG CTG CTG TGC AAG AGC GAG CGC TGC GTG AAC ACC AGC 3 853 

GSFRCVCKAGFARSRPHGAC 1283 

GGC TCC TTC CGC TGC GTC TGC AAA GCC GGC TTC GCG CGC AGC CGC CCG CAC GGG GCC TGC 3 913 

V P Q R R R * 1290 

GTT CCC CAG CGC CGC CGC TGA 3 93 4 

CGCCGCCGACGCCGCCCTCGGCCCAGACCTCGGTGATCACTGAGGGATTTCCGCGAGCTCGGCCTCAC^ 4 013 

CTrGTGGCTCGGACCCAGGGACCrrCAGGGCCCGCAGACCCTCCCGGaSCCTTGAGACCCGAG^ 4 092 

CCCTCCCCGGTTAGCGGGCGGTTGTAAGGTCTCCGGCGGGCGCTGCCTGCCTTCCTCCCA£WX^^ 4171 

TGATAAATCAGATCGTGCCTCTTT AAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 4225 
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hT275 



:tT27 5 



hT275 



hT27 5 



hT27 5 



hT275 



hT27 5 



hT275 



hT275 



hT275 



->CnpncgpCir.gGtCvncpggsvcggycCeCpeGyaisyt:G)crC< - • 
C++ pC+ngG+C + ^ Cp+ +ta++C 
CPL PCMNGGQCSSRN OCLCPPD cTGKFC 

->CnpncgpClngGt:CvntpggsvfggycCeCpeGyaisycGkrC<-- 
C^.+** ^ G C+n pg ^yC+Cp+G+ + l+ ^ +C 

CAMPG-VCRH-GDCLNNPG SYKCVCPPGHSU3PSRTQC 

->CnpncgpClngGtCvntpggsvf ggycCeCpGGyalsycGkrC< - ' 
CRLNQNICGH-GECVPGPP DYSCHCNPGYRSHPQHRYC 

->CnpncgpCina .GcCvncpggsvfggycCeCpeGyals . .ycGkrC< 
r . pC ai^G^Cvnc^-g C^C ^ G^ C 

CE:AE--?CGPGrGXCMNTGG SYNCHCNKGYRLHvgAGGRSC 

• ->CnpncgpCingGtCvncpggsvf ggycCeCpeGyais . ycGkrC< - 

C + ++-^ C gG+Cfa pg t-y C C +Gy+i+ ^ C 

CAKPH-LCGDGGrCINFPG HYKCMCYPGYRLKaSRPPVC 

• ->CnpncgpClngGtCvnt:pggsv£ggycCe .CpeGyalsycGkrC< - 

C ^--^ C G+C n pg ^--^ C+ C + +Gy+ G C 

CRDPS-SCPD-GKCENKPG SFKCIaCQPGYRS-QGGGAC 

* ->CnpncgpClngGtCvntpggsvf ggycCeCpeGyaisycGkrC< - • 

C+ + pC+ G C n+pg t-+-K:+C + +Gya+ -G+ C 

caegs-?csp-g;;cenlpg sfrctcaqgyapapdgrsc 

' ->CnpncgpCingG::C\-ncpggsvrggYcCeCpeGyaisyt:GkrC< - ' 
C +C + n G-C ntpg ++-k:+C Qy Is" + -^C 

CEAGD-VCDN-GICSNTPG SFQCQCLSGYHLSRDRSHC 

• ->CnpncgpClngGCCvnt:pggsvf ggyt:CeCpeGyalsytGkxC< - * 

C++* C + G C+nt g -t-y+C Cp+G+ + 1 G++C 

CDFPA-ACIG-GDCINTNG SYRCLCPQGHRL-VGGRKC 

* ->CnpncgpClngGtCvnt:pggsvf ggycCeCpeGyaisytGkrC<- ' 

C++^--^ Cl +G C n++g +y C+C+eG+ ■#- + C 

CSQDPSLCLPHGACKNLQG SYVCVCDEGFTPTQDQHGC 

* - >Cnpncg . pClngGtCvncpggsvf ggycCeCpeGyalsycGkrC< - 

c + ^ c+ G+Cvnc+ gy+c c + +g++'^ + c 

CMLFGSelCKE-GKCVNTQP GYECYCKQGFYYDGNLLEC 
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' - >CnpncqpClngGcCvncpggsvf ggycCeCpeGyals . y tGkrC< - • 
C *• ♦ t-C n G C nc g gy^G C+- + -^ a + + ^ +-Kr 

hT27 5 CLDES-NCRN-GVCmTRG GYRCACTPP-AEYs PAQRC 



' - >CnpncgpClngGtC-/ncpggsveggytCeCpeGyaisycGkrC< - ' 
0+- + *-^ G G-KTvn + pg +y+CeC ++ +G ++C 

hT27 5 CQDPA-ACRP-GRCVNLPG SYRCECRPPWVPGPSGRDC 



• ->GnpntgpCingGtCvnt:pggsvfggycCeCpeGyaisycGJcrC<- 
+ ^ C Gi-Cv+ pg g GeCp G>^1 + rC 

^'^215 - DSDECRCVS -GRCVPRPG GAACECPGGFQLDASRARC 



-->Gnpntg. . . pGlngG tCvntpggsvtggytCGCpeGyaisytGkrC< 
G -^Cvnc g +'^'^C+C-t- G+a s C 

CRELNQ rg i LC KS E - ^CVNTSG S FRGVCKAG F ARSRPHG AC 
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hT27 5 



hT27 5 



->grC . snpipgravTKse . CCCsvGrgeAWGtp . CElCPvpgcaetke 
+ C + snplpg ^TK+e-KTC s G^- AWG +CP - + -^^ 

QPCgSNPLPG--LTKQEdCCGSIGT- 



-AWGQSkCHKCPQLQYTGVQK 3 29 



P 330 



hT27 5 



->gi:CsnplpgravTKseCCCsvGrgGAWGtpCElCPvpgtaefkeL<;- 
^-C+t-pI r CCCsvG+ AWG C^- CP + ^-gca-*- f ke+ 

HOCQHPLTTR-LTRQLCCCSVGK--AWGARCQRCPTDGTAAFKEI 



hT27 5 



->grCsaplpgravTKseCCCsvGrgeAWGtpC£. . iCPvpgcaefkeL 
C+ li- + vT ^eCCCs+G-f -»-WG+-K:E ^CPv ^-aef^- L 
VFCDSVLATN-VTQQECCCSLGA--GWGDHCEiyPCFVYSSAEFHSL 



hT27 5 



hT27 5 



->grCsnpipgravTKseCCCsvGrgeAWGtpCElCPvpg. . . taefke 

g^c ^pl+g a+T * CCC Gr -WG +CP^+g 
GMCAGPLAGPALTFDDCCCRQGR--GWGAQCRPCPPRGagsHCPTSQ 



hT27 5 
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' ->ruDPqnCsCacggsCtCgcsCkCknC.kCtsCkKsccsCcPagCskCa 
DP sC+ g -C + + ^ +C C^-*- ^ > *Ca 

RDP- - SSCPDG- - - KCENKPGSFKCiACQPGYRSOGGGACRDVNECA 

qgC'/Ckgg. . gaasecskCsCCa< - • 

C g +f C C + 

EGSPCSPGwcENLPGSFRCTCAQ 
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ALIGN calculates ^^P^obal alignment of two sequences 
verBion 2.0uPleaBe cite: Myers and Miller, CABIOS (1989) 

> hT275 n.a. 4225 aa vs. 

> L40459 n.a. 4317 aa 
scoring matrix: paml20.mat. gap penalties: -12/ -4 

77.1% identity; Global alignment score: 16879 

10 20 

inputs GTC GACCCA CG CGTCC GCCCG - - G ATGCC - 



CCTCCTGCTGTCCCCTCCCTACCCTTGGCTTCTCGCCCCGCTCTGCCCTCTGCTACCAACACTCG^^ 

10 20 30 40 50 60 70 



30 40 50 60 70 

inputs CGGGC-C CCGAGGGGC-TGCTGGC- -GG CCTGGCCCC-TGAGATGCGCG 



CTGCTCGGGCTCGACCTCCAATCTCCGAGGGTCGTGCGGCCCCGGATGCCCGGGCCCCGAGCGGTGCCCA 
80 90 100 110 120 130 140 



80 90 100 110 120 130 

inputs GGGCGGGGGC- -GGCGGGGCTGCTGGCGCTGCTGCT GCTGCTGCrn3CTGCTGCTGCTGG-GC- -CT 



CGGCCTGGCCCCTGCGATGCGCCAGGC - CGGCGGATTGGGGCTGCTGGCACTACTCCTGCTGGCGCTGCT 
150 160 170 160 190 200 

140 150 160. 170 180 190 200 

inputs GGGCGGCAG-GGTCGAGGGG-GGGCCGGCCGGCGAGCGGGGCGCAGGCGGGGGCGGGGCGCTGGCCCGCG 



GGGCCCCGGCGGCCGAGGGGTGGGCCGGCCGGGCAGCGGGGCACAGGCGGGGGCGGGGCGCTGGGCC - C - 
210 220 230 240 250 260 270 

210 220 230 240 250 260 270 

i npu t s AGCGCTTCAAGGTGGTCTTTGCGCCGGTGATCTGCAAGCGGACCTGTCrC 



AACGCTTCAAGGTGGTCTTTGCGCCTGTGATCTGCAAGCGGACCTGTCTGAAGG^ 

280 290 300 310 320 330 340 

280 290 300 310 320 330 340 

inputs TTGTCAGCAGGGCTCCAACATGACGCTCATCGGAGAGAACGGCCACAGCAC^^ 



CTGTCAGCAGGGCTCCAACATGAOSCTCATCGGAGAGAACGGCCAC^^ 

350. 360 370 380 390 400 410 

350 360 370 380 390 400 410 

inputs GacrTCCGCGTGGTGGTGTGCCCTCTCCCCTGCATGAATGGCGGCCAQTC 

GCCTTCCGCGTGGTGGTGTGCCCrCTACCCTGCATGAACGGTGGCCA 

420 430 440 450 460 470 460 

420 430 440 450 460 470 480 

inputs TGTGTCCCCCGGACTTCACTGGGCGCTTCTGCCAGGTGCCCGCAGGAGG 



TGTGTCCCCCGGATTTCACGGGGCGCITCTGCCAGGTGCCTGCTQCAGGAACCGGAGC^ 
490 500 510 520 530 540 550 
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540 550 



inputs CTCAGGCCCCGGCCTaAGCAGaACAGGGGCCCTGTCCACAOGGGCGCTGCCGCCCCTO^ 



TTCAGGCCCCGGC-TGGCCCGACC - -GGGCCATGTCCACAGGCCCGCTGCCGCCCCTTGCCCCAGAAGGA 
560 570 580 590 600 610 620 

560 570 580 590 600 610 620 

i npu t s GACTCTGTGGCCAGCAAGCACGCCATCTACGCCGTCCAGGTGATCGCTGACCCTCCTGGGCCCGGGOAGG 

GAGTCTGTGGCTAGCAAACACGCCATTTACGCGGTGCAGGTGATCGCAGATCCTCCCGGGCCGGGG^ 
630 640 650 660 670 680 690 

630 64 0 650 660 670 680 6 90 

inputs GGCCrCCrrGCCCAGCACGCAGCCTTCCTGGTGCCCCTAGGCCCGGGACAGATCTCA 



GTCCTCCTGCACAAOVTGCAGCCrTCTTGGTGCCCCTGGGGCCAGGAC^^ 

700 710 720 730 740 750 760 

700 710 720 730 740 750 760 

inputs CCCGCCCCCCGTGGTGAATGTGCGCGTCCATCACCCGCCCGAGGCCTCAGTCCAGGTGCACCGCATTGi^ 

TCCGCCCCCCGTGGTGAACGTGCGTGTCCATCACCCTCCTGAAGCTTCCGTTCAGGTC 

770 780 790 800 810 820 830 

770 780 790 800 810 820 830 

inputs AGCTCGAACGCCGAGAGCGCAGCCCCCTCCCAGCACCTGCTGCCGCACCCCAAGCCCTCGCACCCTCG<^ 



GGGCCGAACGCTGAAGGCCCAGCCTCTTCCCAGCACTTGCTGCCGCATCCCAA 

840 850 860 870 880 890 900 

840 850 860 870 880 890 900 

inputs CGCCCACCCAGAAGCCCCTCGGCCGCTGCTTTCAGGACACTCTGCCC^ 

CACCCACTCAAAAGCCACTGGGCCGCTGCTTCCAGGACACATTGCCCAAGCAGCCTTGT^ 

910 920 930 940 950 960 970 

910 920 930 940 950 960 970 

inputs CCTCCCCGGCCTCACCAAGCAGGAAGACTGCTGCGGTAGCATCGGCACTGCCTGGGGGCAGAGCAAGTGC 

TTTGCCTGGCCTTACCAAGCAGGAAGATTGCTGCGGTAGCATCGGTACTC 

98a 990 1000 1010 1020 1030 1040 

980 990 1000 1010 1020 1030 1040 

inputs CACAAOTGTCCCCAGCTGCAGTACACAGGAGTGCAGAAGCCAGGGCCTGTACGTGGGGAAGT^ 

CACAAGTGCCCACAGCTTCAGTATACAGGGGTGCAGAAGCCTGTACCTGTACGTGG 

1050 1060 1070 1080 1090 1100 1110 

1050 1060 1070 1080 1090 1100 

inputs ACTGTCCCCAGGGCTACAAQAGGCTTAACAGCACCCACTGCCAGGACATCAACGAGTGCGC^ 

ACTGCCCCCAGGGCTACAAGAGGCTCAACAGCACCCACTGCCAGGATATCAACGAATGTGCGATGCCCGG 
1120 1130 1140 1150 1160 1170 HBO 
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1110 1120 1130 1140 1150 1160 1170 

i npu t S GCGTGTGTCGCCATGGTGACTGCCTCAACAACCCTGGCTCCT^^ 

G AATGTGT - GCCATGGTG ACTGCCTCAACAACCCIXKSCTC^ 

1190 1200 1210 1220 1230 1240 1250 

1180 1190 1200 1210 1220 1230 1240 

i npu t s TTTAGG - CCCCTCCCGTACACAGTGCATTGCAGAC7WVCCGGAGGAGAAGAGCCTGTGTTTCCGCCT 

CTTGGGTCCCCTCGCA - GCACAGTGCATTGCCGACAAACCAGAGGAGAAGAGCCTGTGT^ 

1260 1270 1280 1290 1300 1310 1320 

1250 1260 1270 1280 1290 1300 1310 

inputs GAGCCCTGAGCACCAOTGCCAGCACCCACTGACCACCCGCCTGACCCGCCAGCTCTC 

GAGCACCGAACACCAGTGCCAGCACCCTCIXSACCACACGCCTAACCOT 

1330 1340 1350 1360 1370 1380 1390 

1320 1330 1340 1350 1350 1370 1380 

inputs GGCAAGGCCTGGGGCGCGCGGTGTCAGCGCTGCCCAACAGATGGCACCGCTGCGTTC^ 

GGTAAAGCCTGGGGTGCCCGGTGCCAGCGCTGCCCGGCAGATGGTACAGCAGCCTTC^ 

1400 1410 1420 1430 1440 1450 1460" 

1390 1400 1410 1420 1430 1440 1450 

inputs c-agctgggaagggataccacattctcacctcccaccagacgctca.ccattcagggcgagagt^ 
ccggctgggaaagggtacc^tatcctcacctcccaccagacgctcaccatccaggggga;^ 

1470 1480 1490 1500 1510 1520 1530 

1460 1470 1480 1490 1500 1510 1520 

inputs cccttttcctgcaccctgacgggccacccaagccccagcagcttccggagagccctagccaggct 



ccctcttcctgcaccc - gacgggccacccaaaccccagcagcttcctgaaagccccagccgagcacc^ 

1540 1550 1560 1570 1580 1590 1600 

1530 1540 1550 1560 1570 1580 1590 

i npu t s acctgaggacacag aggaagagagaggggtgaccacggactcaccggtg agtgaggagaggtcagt^ 
cctcgaggacacagaggaagagagagoagtgaccatggatccaccagtgagtgaggagcgat 

1610 1620 1630 1640 1650 1660 1670 

1600 1610 1620 1630 1640 1650 1660 

i npu t s OVGAGCCACCCAACTGCCACCACGACTCCrrcCCCGGCCCTACCCCGAGCTaATC^ 



cagagccaccccactaccaccacctcacccccccggccttacccagagctcatctctcgcccct 

1680 1690 1700 1710 1720 1730 1740 

1670 1680 1690 1700 1710 1720 1730 

i npu t s CGACCATGCGCTGGTTCCTGCCGGACTTGCCTCCITCCCGCAGCGCCGTAaAGATCGCT 



CTACCTTCCACCGGTTCCTGCCAGACTTGCCCCCATCCCGAAGTGCAGTCGAGATCGCCCCCU^ 

1750 1760 1770 1780 1790 1800 1810 



21/60 



WO(H/0(>672 PCT/IJS00/1S184 

1740 1760 1770 1780 ^^1790 1800 

inputs CACAGAGACTCATOAGTaCCGACTGAACCAGAACATCro 

CACAGAGACCGATGAGTGCCGArrGAACCAGAATATCTGTGGCCATGaACAGTGTGTC 

1820 1830 1840 1850 1860 1870 1880 

1810 1820 1830 1840 1850 I860 1870 

i npu t s GACTACTCCTGCCACTGCAACCCCGGCTACCGGTCACATCCCCAGCACCGCT^ 

GATTACTCCTGCCACTGCAACGCTGGCTACCGGTCACACCCGCAGCACCGCrr^ 

1990 1900 1910 1920 1930 1940 1950 

1880 1890 1900 1910 1920 1930 1940 

inputs AGTGCGAGGCAGAGCCCTGTGGCCCGGGGAGCGGCATCTGCATGAACACCGGCGGCTCCTACAATTGCC^ 

AGTGCGAGGCAGAGCCCTGCGGCCCCGGGAAAGGCATCTGTATGAACACTGGTGGCT 

1960 1970 1980 1990 2000 2010 2020 

1950 1960 1970 1980 1990 2000 2010 

inputs CTGCAACCGCGGCTACCGCCTGCACGTGGGCGCCGGGGGGCGCTCGTGCGTGGACCTGAACGAATGCGCC 

CTGCAACCGAGGCTACCGCCTCCACGTGGGTGCAGGGGGCCGCTCGTGCGTGGACCTGAACGAGTGCGCC 
2030 2040 2050 2060 2070 2080 2090 

2 02 0 2 030 2 040 2050 2060 2070 2080 

inputs AAGCCCCACCTGTGCGGCGACGGCGGCTTCTGCATCAACTTTCCCGGTCACTACT^ 

AAGCCTCACCTGTGTGGGGACGGTGGCTTCTGCATCAACTTCCCTGGTCACTACA^ 

2100 2110 2120 2130 2140 2150 2160 

2090 2100 2110 2120 213 0 214 0 2150 

inputs CCGGCTACCGGCTCAAAGCCTCCCGGCCTCCTGTGTGCGAAGACATCGACGAGTGCCGGGACCCAAGCTC 

CTGGCTACCGGCTCAAGGCCTCCCGACCGCCCATTTGCGAAGACATCGACGAGTGTCGCGACCCTAGC^^ 
2170 2180 2190 2200 2210 2220 2230 

2160 2170 2180 2190 2200 2210 2220 

inputs TTGCCCGGATGGCAAATGCGAGAACAAGCCCGGGAGCTTCAAGTC 

CTGCCCTGATGGCAAATGTGAAAACAAACCTGGCAGCTTCA^ 

2240 2250 2260 2270 2280 2290 2300 

2230 2240 2250 2260 2270 2280 2290 

inputs AGCCAGGGGGGCGGGGCCTGTCGCGACGTGAACGAGTGCXSCCGAGGGCAQCCCCTGCTCGCCTG^ 

AGCCAGGGGGGCGGGGCCTGTCGTGATGTCAACGAATGCTCCGAGGGTACCCCCTGCTCT 

2310 2320 2330 2340 2350 2360 2370 

2300 2310 2320 2330 2340 2350 2360 

input 6 GCGAGAACCTCCCGGGCTCCTTCCGCrGCACCTGTGCCCAGGG - CTACGCGCCCGCXSCCCGACGGCCGCA 

GTGAGAACCTTCCGGGTTCTTACCGTTGCACGTGTGCCCAGGOGATACGAACCCGCACAGGACG - CCTCA 
2380 2390 2400 2410 2420 2430 2440 
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2370 2360 2390 2400 2410 2420 2430 

inputs GTTGCTTGGATGTGGACGAGTGTGAGGCTGG<K3ACGTQTGTGACAATC^ 



GTTGCATAGACGTGGATGAGTGTGAGGCTGGGAAAGTGTGCCAAGATGGCATCTGCACGAACACA 

2450 2460 2470 2480 2490 2500 2510 

2440 2450 2460 2470 2480 2490 2500 

inputs ATCTTTCCAGTGTCAGTGCCTCrrCTGGCTACCATCTGTCCAGGGACCGGAGCCACrrGCGAC^ 

CTCTTTCCAGTGTCAGTGCCTCTCCGGCTATCATCTGTCAAGGGATCGGAGCCGCTGTGAGGACATT^ 
2520 2530 2540 2550 2560 2570 2580 

2510 2520 2530 2540 2550 2560 2570 

inputs GAGTGTGACTTCCCTGCAGCCTGCATTGGGGGTGACTGCATCAATACCAATGGCTCCTACAGA 

GAATGTGACTTCCCTCCGGCCTGCATCGGGGGTGACTGCATCAATACCAATGGTTCCTACAGATG 

2590 2600 2610 2620 2630 2640 2650 

2580 2590 2600 2610 2620 2630 2640 

inputs GCCCCCAGGGGCATCGGCTGGTGGGTGGCAGGAAATGCCA AGACATAGATGAGTGCAGCCAGGACCC 

GTCCCCTGGGTCATCGGTTGGTGGGCGGCAGGAAGTGCAAGAAAGATATAGATGAGTGCAGCCAGGACCC 
2660 2670 2680 2690 2700 2710 2720 

2650 2660 2670 2680 2690 2700 2710 

input s GAGCCTGTGCCTTCCCCATGGGGCCTGCAAGAACCTTCAGGGCTCCTATGTGTGTGTCTGCGATGAG^ 



AGGCCTGTGCCTGCCCCATG CCTGCGAGAACCTCCAGGGCTCCTATGTCTGTGTCTGTGATGAGGGT 

2730 2740 2750 2760 2770 2780 

2720 2730 2740 2750 2760 2770 2780 

input s TTCACTCCCACCCAGGACCAGCACGGTTGTGAGGAGGTGGAGCAGCCCCACt^CAAGAA 

TTCACACTCACCCAGGACCAGCATGGGTGTGAGGAGGTGGAGCAGCCCCACCACAAGAAC^ 
2790 2800 2810 2820 2830 2840 2850 

2790 2800 2810 2820 2830 2840 2850 

i np u t s TGAACTTCGATGACACAGTGTTCTGCGACAGCGTATTGGCCACCAACGTGACCCAGCAGGA 

TTAACTTCGATGACACAGTGTTCTGTGACAGCGTATTGGCTACC^ 
2860 2870 2880 2890 2900 2910 2920 

2860 2870 2880 2890 2900 2910 2920 

inputs CTCTCTGGGGGCCGGCTGGGGCGACCACTGCGAAATCTACCCCTGCCCAGTCTACAGCrrCAGCOT 

CTCTCTGGGAGCTGGCTGGGGAGACCACTGCGAAATCTATCCCTGTCCAaTCTA^^ 
2930 2940 2950 2960 2970 2980 2990 

293 0 2 94 0 2950 2960 2 970 2980 2990 

i npu t s CACAGCCTC - TGC CCAGACGGAAAGGGCTACACCCAGGAOU^CAACATCGTCAACTACXKSCATCCC^ 



CACAGCCTGGTGCCTGATGGGAAAAGGCTACACTCAGGACAACAACATTGTGAACTA- 
3000 3010 3020 3030 3040 3050 3060 
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3000 3010 3020 3030 3040 3050 3060 

i npu t s CACCGTGACATCGACGAGTGCATGTTGTTCGGGTCXSG AGATTTGCAAGQAGGGC^ 

CACCGTGACATCGACGAATGCATATTGTTTXXXKSCAGAGATCTGCAAGGAGGGC^ 

3070 3080 3090 3100 3110 3120 3130 

3070 3080 3090 3100 3110 3120 3130 

inputs AGCCTGGCTACGAGTGCTACTGCAAGCAGGGCTTCTACTACGACGGGAACCTGCTGGAATGCGTGGACGT 

AGCCCGGCTACGAGTGCTACTGCAAGCAGGGCTTCTACTACGATGGCAACCTGCTGGAGTGCGTG^ 
3140 3150 3160 3170 3180 3190 3200 

3140 3150 3160 3170 3180 3190 3200 

inputs GGACGAGTGCCTGGACGAGTCCAACTGCCGGAACGORGTGTGTGAGAACACGCGCGGCGGCTACCGCTGT 

GGATGAGTGCTTGGATGAGTCTAACTGCAGGAACGGAGTGTGTGAGAACACACGTGGCGGCTACCGCTC 
3210 3220 3230 3240 3250 3260 3270 

3210 3220 3230 3240 3250 3260 3270 

inputs GCCTCCACGCCCCCTGCCGAGTACAGTCCCGCGCAGCGCCAGTGCCTGAGCCCGGAAGAGATGGACGTGG 



GCCTGCACTCCGCCGGCAGAGTACAGTCCCGCACAGGCCCAGTGTCTGATCCCGGA-GAGATGGA 

3280 3290 3300 3310 3320 3330 3340 

3280 3290 3300 3310 3320 3330 3340 

inputs ACGAGTGCCAGGACCCGGCAGCCTGCCGCCCTGGCCGCTGCGTCAACCTGCCGGGCTCCTACCGCTGCGA 

---GCAC 



3350 3360 3370 3380 3390 3400 3410 

inputs GTGTCGCCCGCCCTGGGTGCCCGGGCCCTCCGGCCGCGATTGCCAGCTCCCCGAGAGCCCGGCCGAGCGT 



3420 3430 3440 3450 3460 3470 3480 

inputs GCCCCGGAGCGGCGCGACGTGTGCIXXSAGCCAGCGCGGAGAGGACGGCATGTGCGCTGGCCCCCTGGCCG 

GCCCCAGAGAGACGTGAAGTGTGCTGGGGCCAGCGAGGAGAGGACGGCATGTGTATGGGGC 

3350 3360 3370 3380 3390 3400 3410 

3490 3500 3510 3520 3530 3540 3550 

inputs GG-CCTGCCCTCACCTTCGACGACTGCTGCTGCCGCCAGGGCCGCGGCTGGGGCGCCCAATGCCGACCGT 

GGACCTGCCCTCACTTTTGATGACTGCTGCTGCCGCCAG - - CCGCGGCTGGGT- ACCCAGTGCAGACCGT 
3420 3430 3440 3450 3460 3470 3480 

3560 3570 3580 3590 3600 3610 3620 

i npu 1 3 GCCCGCCGCGCGGCGCGGGGTCCCATTGCCCGACATCGCAGAGCGAGAGCAATTCCTTCTGGQACAC^ 



GCCCGCCACGTGGCACCGGGTCCCAGTGCCCGACTTCACAGAGTGAGAGCAATTCTTTCTGGGACACAAG 
3490 3500 3510 3520 3530 3540 3550 
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3630 3640 3650 3660 3670 3680 3590 

inputs CCCCCrGCTGTTGGGGAAGCCCCCAAGAGATGAGGACAGTTCAGAGOAGaATTC^^ 

CCCCCTGCTACTGGGGAAGTCTCCGCGAGACGAAGACTlGCrrCAGAGGAaGATTCAGATGAGTC 

3560 3570 3580 3590 3600 3610 3620 

3700 3710 3720 3730 3740 3750 3760 

i npu C s GTGAGTGGCCGCTGCGTGCCGCGGCCGGGCGGCGCCGCGTGCGAGTGTCCCGGCGGCTTCCAGCTCaACG 

GTGAGCGGACGCTGTGTGCCACGGCCAGGCGGGGCGGTATGCGAGTGTCCTGGAGGCTTTCAGCTGGACG 
3630 3640 3650 3660 3670 3680 3690 

3770 3780 3790 3800 3810 3820 3830 

inputs CCTCCCGCGCCCGCTGCGTGGATATCGACGAGTGCCGAGAGCTGAACCAGCGCGGGCTGCTGTGCAAGAG 

CCTCCCGTGCCCGCTGCGTGGACATTGATGAGTGCCGAGAACTGAACCAGCGGGGACTGCTGTGTAAGAG 
3700 3710 3720 3730 3740 3750 3760 

3840 3850 3860 3870 3880 3890 3900 

inputs CGAGCGCTGCGTGAACACCAGCGGCTCCTTCCGCTGCGTCTGCAAAGCCGGCTTCGCGCGCAGCCGCCCG 

CGAGCGGTGCGTGAACACCAGTGGATCCTTCCGCTGTGTCTGCAAAGCTGGCCTCACGC^ 

3770 3780 3790 3800 3810 3820 3830 

3910 3920 3930 3940 3950 3960 

i npu t s CACGGGGCCTGCGTTCCCCAGCGCCGCCGCTGACG CCGCCGACGCC - GC - CCTCGGC - - CCAGACCT 

CACGGG-CCTGCGTGCCTCAGCGCCGCCGCTGATGATGCAGCCATAGCCCACACCTCAGTGATCGATCAT 
3840 3850 3860 3870 3880 3890 3900 

3970 3980 3990 4000 4010 4020 

inputs CGGTGAT CACTGA- -GGGATTTCCG-CGAGCTCGGCCTCA-CTTCTGCCCCGACTTGTGGCTCGG 

CGAGGGTATTTTCACTGAAAGTGG AG ACAG ACAAGTACATCCTTTGCTCCTGACCAAACG AGAG - CATGG 
3910 3920 3930 3940 3950 3960 

4030 4040 4050 4060 4070 4080 4090 

inputs ACCCAGGGA - CCTTCAGGGCCCGCAGA - C - CCTCCCGGCGCCTTGAGACCCGAGGCGCCCCTACCGGC - - 

- ACCCAAGGATCCTTCAGGGCCCACAAATCTCCTTCCCACACCCC - AAACCCAAGGTGCTCCTC 
3970 3980 3990 4000 4010 4020 4030 



inputs 



4100 

_CCC COT CCCC - - - GGTTAGC - 



AGTGCTGTCTGCTTTCTCCCAAGGGTGATTCCrrAGAAACTTCGACATC^ 
4040 4050 4060 4070 4080 4090 4100 

4110 4120 4130 4140 
inputs GGGCGGTTGTAAGGTC- - -TCCG- -GCGGGC-GCTGC- -CTGCCT TC-- 

CTTGGCTTTCAAGGCAAATTGATATTCACATCCAAAGCGGGCAGCATC^ 
4110 4120 4130 4140 4150 4160 4170 
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4150 4160 4170 4180 4190 
inputs - -CT CCCAGAGGGTGT TTCCTAGAAACT - -GATAAA TCAGATCQTGCCT - - -CTT 

AQCTGGGACCCAGGATGTGAAATAGAATTTATTGTGGCTCTGATTATGTACACTAGATGTGCCTGAC 
4180 4190 4200 4210 4220 4230 4240 

4200 4210 4220 
i npu C s - TAA AAAAAAAAAAAAAAAAAAGGGCGGCCGC 



CTGACCAGGCTCACATGGTTTGTACAATAAATACATCCGCCGGGAAAAAAAAAAAAAAAAAAAA 
4250 4260 4270 4280 4290 4300 4310 
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ALIGN calculates ^^globai alignment of two sequences^^ 
verflion 2,0uPlease cite: Myers and Miller, CABIOS (1989) 

> Patent Protein R79475 - (untitled) 1251 aa vs. 

> hT275 a. a. 1289 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

82.8% identity; Global alignment score: 5S80 



10 20 30 40 50 



60 



inputs MRQAA LGLLALLLLALLGPGGRGVGRPGS- -GAQAGAGRWAQRFKVVFAPVICKRTCLKGQCRDSC 

NlRGAGAAGLIJa.LLLLLLLLLGU3GRVEGGPAGERGAGGGGAIiARERFKVVrAPVICKRTCL 

10 20 30 40 50 60 70 

70 80 90 100 110 120 130 

inputs QOGS>mTLIGENGHSTDTLTGSAFRVWCPLPCmGGQCSSRNQCLCPPDFTGRFCQVPAAGTGAGTGSS 

QC^SNMTLIGENGHSTDTLTGSGFRVVVCPLPCMNGGOCSSRNQCLCPPDFTGRFCQVPAGGAGGGTGGS 
80 90 100 110 120 130 140 

140 150 160 170 180 190 200 

inputs GPG-WPDRAMSTGPLPPI^EGESVASKHAIYAVQVIADPPGPGEGPPAQHAAFLVPLGPGOISAEVQAP 

GPGLSRTGALSTGALPPUIPEGDSVASKHAIYAVQ\^ADPPGPGEGPPAQHAAFLVTIX3PG0ISAE^ 

150 160 170 180 190 200 210 

210 220 230 240 250 260 270 

inputs PPVVNVRVHHPPEASVQVTOIEGPNAEGPASSQHLLPHPIO'QHPRPPTQKPI/SRCFQDTLPKOPCG^ 

PPWNVRVHHPPEASVQVHRIESSNAESAAPSQRLLPHPKPSHPRPPTQKPLGRCFQDTLPKQPCGSNPL 
220 230 240 250 260 270 280 

280 290 300 310 320 330 340 

inputs PGLTKQEDCCGSIGTAWGQSKCHKCPQLQYTGVQKPVPVRGEVGADCPQGYKRLNSTHCQDIKECAMPGN 

PGLTKQEDCCGSIGTAWGQSKCHKCPQLQrrGVQKi^GPVTlGEVGADCPQGYKRI^STHCQDINEC^ 

290 300 310 320 330 340 350 

350 360 370 380 390 400 410 

nputs VCHGDCLNNPGSYRCVCPPGHSLGPIJUVQCIADKPEEKSLCFRLVSTEHQCQHPLTT^ 

CRHGDCLNNPGSYRCVCPPGHSIXSPSRTQCIADKPEEKSLCFRLVSPEHQCQHPLTTRLTRQL^ 

360 370 380 390 400 410 420 

420 430 440 450 460 470 480 

nputs AWGARCQRCPADGTAAFKEICPGWE- -RVPYPHLPPDAHHPGGKRLLPLPAPDGPPKPQQLPESPSRAPP 

AWGARCQRCPTDGTAAFKEICPAGKGYHILTSHQTLTIQGESDFSLF-LH-PDOPPKPQQLPESPSQAPP 
430 440 450 460 470 480 

490 500 510 520 530 540 550 

nputs LEDTEEERGVTMDPPVSEERSVQQSHPTTTTSPPRPYPELISRPSPPTFHRFLPDLPPSRSAVEIAPTQV 

PEDTEEERGVTTDSPVSEERSVQQSHPTATTTPARPYPELISRPSPPTMRWFLPDLPPSRSAVEIAPTQV 
490 500 510 520 530 540 550 
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560 570 580 590 600 610 620 

inputs TETDECRLNQNICGHGQCVPGPSDYSCHCNAGYRSHPQHRYC7DVNECEAEPTO 



TETDECRI^QNICGHGECVPGPPDYSCHCNPGYRSHPQHRYCrTOVNECEAEPCGPGRGICM^^^ 
560 570 580 590 600 610 620 

630 640 650 660 670 680 690 

inputs CNKGYRIiHVGAGGRSCVDLNECAKPHLCGDGGFCINFPGHYKCNCYPGYIU.K^ 



CWGYRUIVGAGGRSCVDLNECAKPflLCGDGGFCINFPGHYKCNCYPGYTajKASRPPVCEDIDECRDP 
630 640 650 660 670 680 690 

700 710 720 730 740 750 760 

inputs CPDGKCENKPGSFKCIACQPGYRSQGGGACRDVNECSEGTPCSPGWCEKLPGSYRCTCAQG- IRTRTGRL 



CPIXSKCENKPGSFKCIACQPGYRSQGGGACRDVNECAEGSPCSPGWCENLPGSFRCTCAQGYAPAPD^ 
700 710 720 730 . 740 750 760 

770 7B0 790 800 810 820 830 

inputs SCIDVDDCEAGCTCQDGICTOTPGSFQCQCLSGYHLSRTRSRCEDIDECDFPAACIGGDCINTNGSYRCL 



SCI^VDECEAGDVCDNGICSNTPGSFQCQCLSGYHLSRDRSHCEDIDECDFPAACIGGDCINTNGSYRCL 
770 780 790 800 810 820 830 

840 850 860 870 880 890 

inputs CPLGHRLVGGRKCKKDIDECSQDPGLCLPH-ACENLCXSSYVCVCDEGFTLTQDQHGCEEVEQPHHK^ 

CPQGHRLVGGRKCQ-DIDECSQDPSLCLPHGACKOT.OGSYVCVCDEGFTPTQDQHGCEEVEQPHHKKECY 
840 850 860 870 880 890 900 

900 910 920 930 940 950 960 

inputs U4FDDTVTCDSVIATNVTQOECCCSIX2AGWGDHCEIYPCPVYSSAEFHSLVPIX3KR^ 

UIFDDTVTCDSVIATNVTQQECCCSIX^AGWGDHCEIYPCPVYSSAEFHSLCPDGKGYTQDN^ 
910 920 930 940 950 960 970 

970 980 990 1000 1010 1020 1030 

inputs HRJDIDECILFGAEICKEGKCVNSQPGYECYCKQGFYYDGm.LECVDVDECIJ)ESNCOT 

HRDIDEC^^JFGSEICKEGKCVNTQPGYECYCKQGFY^G^^JLECV^ 
980 990 1000 1010 1020 1O30 1040 

1040 1050 1060 
inputs ACTPPAEYSPAQAQCLSPEEM --EK 

ACTPPAEYSPAQRQCLSPEEMDVDECQDPAACllPGRCVKLPGSYRCECRPPWVPGPSGRDCQLPESPAER 
1050 1060 1070 1080 1090 1100 1110 

1070 1080 1090 1100 1110 1120 1130 

inputs APERREVCWGQRQEIXSMCMGPIiAGPALTFDDCCCRQPRL-GYQCRPCPPRGTGSQCPTSQSESNSFWDTS 



APERRDVCWSQRGEDGMCAGPLAGPALTFDDCCCRQGRGWGAQCRPCPPRGAGSHCPTSQSESNSFVTDTS 
1120 1130 1140 1150 1160 1170 1180 
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1140 IISO 1160 1170 1180 1190 X200 

inputs PLLIiGKSPRDEDSSEEDSDECRCVSGPa^RPGGAVCECPGGFQLDASRARCVDIDECREIiNQRG^ 

PLLLGKPPRDEDSSEEDSDECRCVSGRCVPRPGGAACECPGGFQLDASRARCVDIDECREI^^ 
1190 1200 1210 1220 1230 1240 1250 

1210 1220 1230 1240 1250 

inputs ERCVNTSGSFRCVCKAGFTRSRPHGPACLSAAADDAAIAHTSVIDHRGYFH 

ERCVNTSGSFRCVCKAGFARSRPHG -ACVP QRR R 

1260 1270 1280 
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cctcctgctg tcccctccct acccttggct tctcgccccg ctctgccctc tgctaccaac 
60 

actcgatccc ctgctcgggc tcgacctcca atctccgagg gtcgtgcggc cccggatgcc 
120 

cgggccccga gcggtgccca cggcctggcc cctgcg atg cgc cag gcc ggc gga 
174 

Met Arg Gin Ala Gly Gly 
1 5 

ttg ggg ctg ctg gca eta etc ctg ctg gcg ctg ctg ggc ccc ggc ggc 
222 

Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala Leu Leu Gly Pro Gly Gly 

10 15 20 

cga ggg gtg ggc egg ccg ggc age ggg gca cag gcg ggg gcg ggg cgc 
270 

Arg Gly Val Gly Arg Pro Gly Ser Gly Ala Gin Ala Gly Ala Gly Arg 
25 30 35 

tgg gcc caa cgc ttc aag gtg gtc ttt gcg cct gtg ate tgc aag egg 

318 

Trp Ala Gin Arg Phe Lys Val Val Phe Ala Pro Val lie Cys Lys Arg 
40 45 50 

ace tgt ctg aag ggc cag tgt egg gac age tgt cag cag ggc tec aac 
366 

Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser Cys Gin Gin Gly Ser Asn 

55 60 65 70 

atg aeg etc ate gga gag aac ggc cae age ace gac acg etc ace ggt 
414 

Met Thr Leu He Gly Glu Asn Gly His Ser Thr Asp Thr Leu Thr Gly 

75 80 85 

tct gcc ttc cgc gtg gtg gtg tgc cct eta ccc tgc atg aac ggt ggc 
462 

Ser Ala Phe Arg Val Val Val Cys Pro Leu Pro Cys Met Asn Gly Gly 
90 95 100 

cag tgc tct tec cga aac cag tgc ctg tgt ccc ccg gat ttc aeg ggg 
510 

Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys Pro Pro Asp Phe Thr Gly 
105 110 115 

cgc ttc tgc cag gtg cct get gca gga ace gga get ggc ace ggg agt 
558 

Arg Phe Cys Gin Val Pro Ala Ala Gly Thr Gly Ala Gly Thr Gly Ser 
120 125 130 

tea ggc ccc ggc tgg ccc gac egg gcc atg tec aca ggc ccg ctg ccg 
606 

Ser Gly Pro Gly Trp Pro Asp Arg Ala Met Ser Thr Gly Pro Leu Pro 

135 140 145 150 

ccc ctt gcc cca gaa gga gag tct gtg get age aaa cac gcc att tac 
654 

Pro Leu Ala Pro Glu Gly Glu Ser Val Ala Ser Lys His Ala lie Tyr 
155 160 165 

9cg gtg cag gtg ate gca gat cct ccc ggg ccg ggg gag ggt cct cct 
702 

Ala Val Gin Val He Ala Asp Pro Pro Gly Pro Gly Glu Gly Pro Pro 
170 175 180 
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gca caa cat gca gcc ttc ttg gtg ccc ctg ggg cca gga caa ate teg 
750 

Ala Gin His Ala Ala Phe Leu Val Pro Leu Gly Pro Gly Gin lie Ser 

185 190 195 

gca gaa gtg cag get ccg ccc ecc gtg gtg aac gtg cgt gtc cat cac 
798 

Ala Glu Val Gin Ala Pro Pro Pro Val Val Asn Val Arg Val His His 

200 205 210 

cct cct gaa get tcc gtt cag gtg cac cgc ate gag ggg ccg aac get 
846 

Pro Pro Glu Ala Ser Val Gin Val His Arg lie Glu Gly Pro Asn Ala 

215 220 225 230 

gaa ggc cca gcc tct tcc cag cac ttg ctg ccg cat ccc aag ccc ccg 
894 

Glu Gly Pro Ala Ser Ser Gin His Leu Leu Pro His Pro Lys Pro Pro 
235 240 245 

cac ccg agg cca ecc act caa aag cca ctg ggc cgc tgc ttc cag gac 
942 

His Pro Arg Pro Pro Thr Gin Lys Pro Leu Gly Arg Cys Phe Gin Asp 

250 255 260 

aca ttg ccc aag cag cct tgt ggc age aac cct ttg cct ggc ctt acc 
990 

Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn Pro Leu Pro Gly Leu Thr 

265 270 27S 

aag cag gaa gat tgc tgc ggt age ate ggt act gcc tgg gga caa age 
1038 

Lys Gin Glu Asp Cys Cys Gly Ser He Gly Thr Ala Trp Gly Gin Ser 
280 285 290 

aag tgt cac aag tgc cca cag ctt cag tat aca ggg gtg cag aag cct 
1086 

Lys Cys His Lys Cys Pro Gin Leu Gin Tyr Thr Gly Val Gin Lys Pro 

295 300 305 310 

gta cct gta cgt ggg gag gtg ggt get gac tgc ccc cag ggc tac aag 
1134 

Val Pro Val Arg Gly Glu Val Gly Ala Asp Cys Pro Gin Gly Tyr Lys 
315 320 325 



335 340 



350 355 



365 370 



375 380 385 390 



agg 


etc 


aac 


age 


1182 








Arg 


Leu 


Asn 


Ser 








330 


ggg 


aat 


gtg 


tgc 


1230 








Gly 


Asn 


Val 


Cys 






345 




tgt 


gtc 


tgc 


ccg 


1278 








Cys 


Val 


Cys 


Pro 




360 






att 


gcc 


gac 


aaa 


1326 








He 


Ala 


Asp 


Lys 
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• 9 

acc gaa cac cag tgc cag cac cct ctg acc aca cgc eta acc cgc cag 

^Thr Glu His Gin Cys Gin His Pro Leu Thr Thr Arg Leu Thr Arg Gin 
395 400 405 

etc tgc tgc tgt agt gtg ggt aaa gcc tgg ggt gcc egg tgc cag cgc 

^Leu Cys Cys Cys Ser Val Gly Lys Ala Trp Gly Ala Arg Cys Gin Arg 
410 415 420 

tgc ccg gca gat ggt aca gca gcc ttc aag gag ate tgc ccc ggc tgg 

^Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys Glu He Cys Pro Gly Trp 
425 430 435 

gaa agg gta cca tat cct cac etc cca cca gac get cac cat cca ggg 

^Glu Arg Val Pro Tyr Pro His Leu Pro Pro Asp Ala His His Pro Gly 
440 445 450 

gga aag ega ctt etc cct ctt cct gca ccc gac ggg eca ccc aaa ccc 

^Glv Lys Arq Leu Leu Pro Leu Pro Ala Pro Asp Gly Pro Pro Lys Pro 
455 460 465 470 

cag cag ctt cct gaa age ccc age cga gca cca ccc etc gag gac aca 

^Gln Gin Leu Pro Glu Ser Pro Ser Arg Ala Pro Pro Leu Glu Asp Thr 
475 480 485 

gag gaa gag aga gga gtg acc atg gat cca cca gtg agt gag gag cga 

^Glu Glu Glu Arg Gly Val Thr Met Asp Pro Pro Val Ser Glu Glu Arg 
490 495 500 

teg gtg cag cag age cac ccc act acc acc acc tea ccc ccc egg cct 

^Ser Val Gin Gin Ser His Pro Thr Thr Thr Thr Ser Pro Pro Arg Pro 
505 510 515 

tac cca gag etc ate tet cgc ccc tec cca cct acc ttc cac egg ttc 

^Tyr Pro Glu Leu He Ser Arg Pro Ser Pro Pro Thr Phe His Arg Phe 
520 525 530 

ctg eca gac ttg ccc cca tec ega agt gca gtg gag ate gee ccc act 

^Leu Pro Asp Leu Pro Pro Ser Arg Ser Ala Val Glu He Ala Pro Thr 
535 540 545 

cag gte aca gag ace gat gag tgc cga ttg aac cag aat ate tgt ggc 

^Gln Val Thr Glu Thr Asp Glu Cys Arg Leu Asn Gin Asn He Cys Gly 
555 560 565 

cat gga cag tgt gtg cct ggc ccc teg gat tac tec tgc cac tgc aac 
1902 

His Gly Gin Cys Val Pro Gly Pro Ser Asp Tyr Ser Cys His Cys Asn 
570 575 580 

get ggc tac egg tea cac ccg cag cac cgc tac tgt gtt gat gtg aac 

^Ala Gly Tyr Arg Ser His Pro Gin His Arg Tyr Cys Val Asp Val Asn 
585 590 595 
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gag tgc gag gca gag ccc tgc ggc ccc ggg aaa ggc ate tgt atg aac 
1998 

Glu Cys Glu Ala Glu Pro Cys Gly Pro Gly Lys Gly He Cys Met Asn 

600 605 610 

act ggt ggc tec tac aat tgt cac tgc aac cga ggc tac cgc etc cac 
2046 

Thr Gly Gly Ser Tyr Asn Cys His Cys Asn Arg Gly Tyr Arg Leu His 

615 620 625 630 

gtg ggt gca ggg ggc cgc teg tgc gtg gac ctg aac gag tgc gee aag 
2094 

Val Gly Ala Gly Gly Arg Ser Cys Val Asp Leu Asn Glu Cys Ala Lys 

635 640 645 

cell cac ctg tgt ggg gac ggt ggc ttc tgc ate aac ttc cct ggt cac 
2142 

Pro His Leu Cys Gly Asp Gly Gly Phe Cys He Asn Phe Pro Gly His 

650 655 660 

tac aaa tgc aac tgc tat cct ggc tac egg etc aag gee tec cga ecg 

2190 

Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg Leu Lys Ala Ser Arg Pro 

665 670 675 

ccc att tgc gaa gac ate gac gag tgt cgc gac cct age acc tgc cct 

2238 

Pro He Cys Glu Asp He Asp Glu Cys Arg Asp Pro Ser Thr Cys Pro 

680 €85 690 

gat ggc aaa tgt gaa aac aaa cct ggc age ttc aag tgc ate gee tgc 
2286 

Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser Phe Lys Cys He Ala Cys 

695 700 705 710 

cag cct ggc tac cgt age cag ggg ggc ggg gcc tgt cgt gat gtc aac 
2334 

Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly Ala Cys Arg Asp Val Asn 
715 720 725 

gaa tgc tec gag ggt acc ccc tgc tct cct gga tgg tgt gag aac ctt 
2382 

Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro Gly Trp Cys Glu Asn Leu 
730 735 740 

ccg ggt tct tac cgt tgc acg tgt gcc cag ggg ata cga acc cgc aca 
2430 

Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin Gly He Arg Thr Arg Thr 
745 750 755 

gga cgc etc agt tgc ata gac gtg gat gag tgt gag get ggg aaa gtg 
2478 

Gly Arg Leu Ser Cys He Asp Val Asp Glu Cys Glu Ala Gly Lys Val 
760 765 770 

tgc caa gat ggc ate tgc acg aac aca cca ggc tct ttc cag tgt cag 
2526 

Cvs Gin Asp Gly He Cys Thr Asn Thr Pro Gly Ser Phe Gin Cys Gin 

775 780 785 790 

tgc etc tee ggc tat cat ctg tea agg gat egg age cgc tgt gag gac 
2574 

Cys Leu Ser Gly Tyr His Leu Ser Arg Asp Arg Ser Arg Cys Glu Asp 

795 800 805 
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att gat gaa tgt gac ttc cct gcg gcc tgc ate ggg ggt gac tgc ate 
2622 

lie Asp Glu Cys Asp Phe Pro Ala Ala Cys lie Gly Gly Asp Cys lie 
810 815 820 

aat aec aat ggt tec tae aga tgt etc tgt ccc etg ggt eat egg ttg 
2670 

Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys Pro Leu Gly His Arg Leu 
825 830 835 

gtg ggc ggc agg aag tgc aag aaa gat ata gat gag tgc age cag gac 
2718 

Val Gly Gly Arg Lys Cys Lys Lys Asp lie Asp Glu Cys Ser Gin Asp 
840 845 850 

cca ggc ctg tgc ctg ccc cat gcc tgc gag aac etc cag ggc tec tat 
2766 

Pro Gly Leu Cys Leu Pro His Ala Cys Glu Asn Leu Gin Gly Ser Tyr 

855 860 865 870 

gtc tgt gtc tgt gat gag ggt ttc aca etc ace cag gac cag cat ggg 
2814 

Val Cys Val Cys Asp Glu Gly Phe Thr Leu Thr Gin Asp Gin His Gly 
875 880 885 

tgt gag gag gtg gag cag ccc cac cac aag aag gag tgc tac ctt aac 
:>862 

Cys Glu Glu Val Glu Gin Pro His His Lys Lys Glu Cys Tyr Leu Asn 

890 895 900 

ttc gat gac aca gtg ttc tgt gac age gta ttg get acc aat gtc act 
2910 

Phe Asp Asp Thr Val Phe Cys Asp Ser Val Leu Ala Thr Asn Val Thr 

905 910 915 

cag cag gaa tgc tgt tgc tct ctg gga get ggc tgg gga gac cac tgc 
2958 

Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala Gly Trp Gly Asp His Cys 

920 925 930 

gaa ate tat ccc tgt cca gtc tac age tea gcc gaa ttt cac age ctg 
3006 

Glu lie Tyr Pro Cys Pro Val Tyr Ser Ser Ala Glu Phe His Ser Leu 

935 940 945 950 

gtg cct gat ggg aaa agg eta cac tea gga caa eaa cat tgt gaa eta 
3054 

Val Pro Asp Gly Lys Arg Leu His Ser Gly Gin Gin His Cys Glu Leu 

955 960 965 

tgc att cct gcc cac cgt gac ate gac gaa tgc ata ttg ttt ggg gca 
3102 

Cys He Pro Ala His Arg Asp He Asp Glu Cys He Leu Phe Gly Ala 
970 975 980 

gag ate tgc aag gag ggc aag tgt gtg aac acg cag ccc ggc tae gag 
3150 

Glu He Cys Lys Glu Gly Lys Cys Val Asn Thr Gin Pro Gly Tyr Glu 

985 990 995 

tgc tac tgc aag cag ggc ttc tac tac gat ggc aac ctg ctg gag tgc 
3198 

Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp Gly Asn Leu Leu Glu Cys 
1000 1005 1010 
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gtg gac gtg gat gag tgc ttg gat gag tct aac tgc agg aac gga gtg 
3246 

Val Asp Val Asp Glu Cys Leu Asp Glu Ser Asn Cys Arg Asn Gly Val 

015 1020 1025 1030 

tgt gag aac aca cgt ggc ggc tac cgc tgt gcc tgc act ccg ccg gca 
3294 

Cys Glu Asn Thr Arg Gly Gly Tyr Arg Cys Ala Cys Thr Pro Pro Ala 
1035 1040 1045 

gag tac agt ccc gca cag gcc cag tgt ctg ate ccg gag aga tgg age 
3342 

Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu lie Pro Glu Arg Trp Ser 
1050 1055 1060 

acg ccc cag aga gac gtg aag tgt get ggg gcc age gag gag agg acg 
3390 

Thr Pro Gin Arg Asp Val Lys Cys Ala Gly Ala Ser Glu Glu Arg Thr 
1065 1070 1075 

gca tgt gta tgg ggc ccc tgg geg gga cet gcc etc act ttt gat gac 
3438 

Ala Cys Val Trp Gly Pro Trp Ala Gly Pro Ala Leu Thr Phe Asp Asp 
1080 1085 1090 

tgc tgc tgc cgc cag ccg egg ctg ggt ace cag tgc aga ccg tgc ccg 
3486 

Cys Cys Cys Arg Gin Pro Arg Leu Gly Thr Gin Cys Arg Pro Cys Pro 

095 1100 1105 1110 

cca cgt ggc acc ggg tec cag tgc ccg act tea cag agt gag age aat 
3534 

Pro Arg Gly Thr Gly Ser Gin Cys Pro Thr Ser Gin Ser Glu Ser Asn 
1115 1120 1125 

tct tte tgg gac aca age ccc ctg eta ctg ggg aag tct ccg cga gac 
3582 

Ser Phe Trp Asp Thr Ser Pro Leu Leu Leu Gly Lys Ser Pro Arg Asp 
1130 1135 1140 

gaa gac age tea gag gag gat tea gat gag tgc cgt tgt gtg age gga 
3630 

Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu Cys Arg Cys Val Ser Gly 

1145 1150 1155 

cgc tgt gtg cca egg cca ggc ggg geg gta tgc gag tgt cet gga ggc 
3678 - 

Arg Cys Val Pro Arg Pro Gly Gly Ala Val Cys Glu Cys Pro Gly Gly 
1160 1165 1170 

ttt cag ctg gac gee tec cgt gcc cgc tgc gtg gac att gat gag tgc 
3726 

Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys Val Asp lie Asp Glu Cys 

175 1180 1185 1190 

cga gaa ctg aac cag egg gga ctg ctg tgt aag age gag egg tgc gtg 
3774 

Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys Lys Ser Glu Arg Cys Val 
1195 1200 1205 

aac acc agt gga tec tte cgc tgt gtc tgc aaa get ggc ttc acg cgc 
3822 

Asn Thr Ser Gly Ser Phe Arg Cys Val Cys Lys Ala Gly Phe Thr Arg 
1210 1215 1220 
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3870 

Ser Arg Pro His Gly Pro Ala Cys Leu Ser Ala Ala Ala Asp Asp Ala 
1225 1230 1235 

gcc ata gcc cac acc tea gtg ate gat cat cga ggg tat ttt cac 
3915 

Ala He Ala His Thr Ser Val He Asp His Arg Gly Tyr Phe His 
1240 1245 1250 

tgaaagtgga gacagacaag tacatccttt gctcctgacc aaacgagagc atciqacccaa 
3975 

ggatccttca gggcccacaa atctccttcc cacaccccaa acccaaggtg ctcctgtctg 
4 03 5 

cagagtgctg tctgctttct cccaagggtg attcctagaa acttcgacat caqatctqcc 
4095 ^ y 

cctttaattt actcttggct ttcaaggcaa attgatattc acatccaaag cqqqcaqcat 
4155 

caactgcttg gcgggttgga ctgagctggg acccaggatg tgaaatagaa tttattqtqq 
4215 ^ 
.ctctgattat gtacactaga tgtgcctgac ctgctgacca ggctcacatg gtttgtacaa 
4275 

taaatacatc cgccgggaaa aaaaaaaaaa aaaaaaaaaa a 

1 
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laput file T300At:hX672i5; Output File T3 00AChX672i5 .pac 
Sequence length 133 2 

MASLGLLLLLI. - 12 

CCAAGAATTCGGCACGAGGAGAGGCCGGCC ATG GCC AGC CTG GGG CTG CTG CTC CTG CTC TTA CTG 6 6 

ACA GCA CTG CCA CCG CTG TGG TCC TCC TCA CTG CCT GGG CTG GAC ACT GCT GA\ AGT 12 6 

^^^X^^^DLILSALZRATVF:. EQ 52 

GCC ACC ATT GCA GAC CTG ATC CTG TCT GCG CTG GAG AGA GCC ACC GTC TTC CTA GAA GAG 18 6 

pLpCiNLDGMVGUKVLEEQ^ 72 

AGG CTG CCT GAA ATC AAC CTG GAT GGC ATG GTG GGG GTC CGA CTG CTG G?J^ GAG CAG CTA 246 

j^5VR£KWAQE?LLQ?LSL R'/ 92 

AAA AGT GTC CGG GkG AAG TGG GCC CAG GAG CCC CTG CTG CAA CCG CTG AGC CTG CGC GTG 3 0 6 

GMLGEKLEAAIQJ^SLHYLKL 112 

GGG ATG CTG GGG GAG Ai^.G CTG GAG GCT GCC ATC CAG AGA TCC CTC CAC TAC CTC AAG CTG 366 

Si3.>KYLHE^Q~-^TLQPGFWKC. 132 

AGT GAT CCC AAG TAC CTA AGA GAG TTC CAG CTG ACC CTC CAG CCC GGG TTT TGG AAG CTC 42 6 

pHAWIHTDASLVYPTFCrgu 152 

CCA CAT GCC TGG ATC CAC ACT GAT GCC TCC TTG GTG TAC CCC ACG TTC GGG CCC CAG GAC 436 

2PS^h:ksdvclvqllgtc;td 172 

TCA TTC TCA GAG GAG AGA AGT GAC GTG TGC CTG GTG CAG CTG CTG GGA ACC GGG ACG GAG 546 



gSZpCGLSDLCRSLMTKi' ^ 
ACC AGC GAG CCC TGC GGC CTC TCA GAC CTC TGC AGG AGC CTC ATG ACC AAG CGC GGC .GC 

SGYCLSKQLLF7LWARK -AGC 
TCA GC-C TAC TGC CTG TCC CAC CAA CTG CTC TTC TTC CTC TGG GCC AGA ATG AGG GC-G TGC 

tqgfloqso:^y:ni.fca txm 

ACA CAG GGA CCA CTC CAA CAG AGC CAG GAC TAT ATC AAC CTC TTC TGC GCC A\C ATG ATG 

D L N R K A E A I G Y A y P T R D C F M 
GAC TTG AAC CGC AGA GCT GAG GCC ATC GGA TAC GCC TAC CCT ACC CGG GAC AFC TTC ATG 

Ej^I, HFCGNGGFSDFYKI.RW:. 272 
GAA AAC ATC ATG TTC TGT CGA ATG GGC GGC TTC TCC GAC TTC TAC AAG CTC CGG TGG CTG E46 



^^I^LsWQKQQEGCFGEPD/-^ 
GAG CCC ATT CTC AGC TOG CAG AAA CAG CAQ GAA GGA TGC TTC GGG GAG CCT GAT GCT GAA 



192 
606 

212 

666 

232 

726 

252 
786 



292 
906 

DESLSKAIQYQOHFSRRVKR 312 
CAT GAA GAA TTA TCT AAA GCT ATT CAA TAT CAG CAG CAT TTT TCG AGC AGA GTG AAG AGG 9 66 

R ^ K Q F P D G C S S W T A T A V A A 332 

CGA GAA AAA CAA TTT CCA GAT GGC TGC TCC TCC CAC AAC ACA GCC ACA GCA CTG GCA GCC 102 6 

r,GG?LYlL^^Y?PAN^l=^P>iP 



CTG GGT 



GGC TTC CTA TAC ATC CTG GCA GAA TAC CCC CCA GCA AAC AGA GAG C:CA CAC CCA 108 6 
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T?C A?A CCG C?A CCA AGC AGC CGC 

O.C^ACC3rrCCAX.CCACC.CCC.CaAOaACOAACAC.CCCC^O^.X^ 
OCCCAC^OTAOOTGGCaGCCCaC^OCK^CCCAaaTOTCAACGGATaAATAAAGTTC.^ 
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mouse TANGO 300 sequence 

gtcgacccac gcgtccgcat ccaccagcag aaatcctgtc atg gcg aga etc ggg 55 

Met Ala Arg Leu Gly 
1 5 

ctg ctt etc etc ctg ctg ctg gcc ctg cca cca cac ttc tec tea gtg 
103 

Leu Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro His Phe Ser Ser Val 
10 15 20 

tea tgg eca gae act gea cag ggc acc atg gca aac ttg ate ctg act 
151 

Ser Trp Pro Asp Thr Ala Gin Gly Thr Met Ala Asn Leu He Leu Thr 

25 30 35 

gea tta gaa aaa gcc acc ttg ttc ttg gag gac agg ctg ccc aca ate 
199 

Ala Leu Glu Lys Ala Thr Leu Phe Leu Glu Asp Arg Leu Pro Thr He 

40 45 50 

aac ctg gat ggt gtg gtg ggc ttc caa gtg ctg gaa gtg caa etc cga 
247 

Asn Leu Asp Gly Val Val Gly Phe Gin Val Leu Glu Val Gin Leu Arg 

55 60 65 

gga gtt cag gaa aaa tgg get cac aag ccc ttg ctg cag ect etc age 
295 

Gly Val Gin Glu Lys Trp Ala His Lys Pro Leu Leu Gin Pro Leu Ser 

70 75 80 8B 

atg cgc get gga cag atg gcc aac aca ctg tct get etc etc caa aaa 
343 

Met Arg Ala Gly Gin Met Ala Asn Thr Leu Ser Ala Leu Leu Gin Lys 
90 95 100 

tec ate ttc tac etc aag cag agt gac ccc acg tac eta aga gag ttc 
391 

Ser lie Phe Tyr Leu Lys Gin Ser Asp Pro Thr Tyr Leu Arg Glu Phe 
105 110 115 

cag cca age att cag ect ggg ttt tgg aag ttg ccc aat gac tgg aca 
439 

Gin Pro Ser He Gin Pro Gly Phe Trp Lys Leu Pro Asn Asp Trp Thr 

120 125 130 

cgc ace aat gcc tec eta gtc tac ccc tgg ctg gaa ccc ctg gac tct 
487 

Arg Thr Asn Ala Ser Leu Val Tyr Pro Trp Leu Glu Pro Leu Asp Ser 
135 140 145 

ttc tea gag gaa age age gat gtg tgc ctg gtg caa eta eta gga aca 
535 

Phe Ser Glu Glu Ser Ser Asp Val Cys Leu Val Gin Leu Leu Gly Thr 
150 155 160 165 

ggg aca gac age age cag ect tgc agg etc tec aac ttc tgc aga acc 
583 

Gly Thr Asp Ser Ser Gin Pro Cys Arg Leu Ser Asn Phe Cys Arg Thr 
170 175 180 

ctt atg acc aag gcc ggc tgc tea ggc tac age etc tec cat cag ctg 
631 

Leu Met Thr Lys Ala Gly Cys Ser Gly Tyr Ser Leu Ser His Gin Leu 
185 190 195 
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etc ttc ttc etc tgg gcc aga atg caa ggg tgc acg gag gga ccg Lcc 
679 

Leu Phe Phe Leu Trp Ala Arg Met Gin Gly Cys Thr Glu Gly Leu Phe 
200 205 210 

etc cag age caa cac tac atg gac ate ttc tgt gcc aat atg atg gaa 
727 

Leu Gin Ser Gin His Tyr Met Asp lie Phe Cys Ala Asn Met Met Glu 

215 220 225 

ctg aac cae aga get gag gcc gtt gga tac get tac ccc acc caa gac 
775 

Leu Asn His Arg Ala Glu Ala Val Gly Tyr Ala Tyr Pro Thr Gin Asp 

230 235 240 245 

etc ttc atg gaa aac att atg ttc tgt ggt atg get ggc ttc tet gac 
823 

Leu Phe Met Glu Asn lie Met Phe Cys Gly Met Ala Gly Phe Ser Asp 
250 255 260 

ttc tac aag ctg cgc tgg ctg gag gcc att etc age tgg cag aac ccc 
871 

Phe Tyr Lys Leu Arg Trp Leu Glu Ala lie Leu Ser Trp Gin Asn Pro 
265 270 275 

cag gtg gga tgc ttc ggg agg cct gac aca aag ggt gaa cct tct gaa 
919 

Gin Val Gly Cys Phe Gly Arg Pro Asp Thr Lys Gly Glu Pro Ser Glu 
280 285 290 

gtt cca cat cag cag ggc att ctg aga aga gtg cga agg egg gaa aaa 
967 

Val Pro His Gin Gin Gly He Leu Arg Arg Val Arg Arg Arg Glu Lys 

295 300 305 

ctg ttc gca gat ggc tgt teg tgc cac aac aca gcc aca gea gte gca 
1015 

Leu Phe Ala Asp Gly Cys Ser Cys His Asn Thr Ala Thr Ala Val Ala 

310 315 320 325 

gcc ctg ggt ggc ttt etc tac ate ctg gca gaa tac cac eca gac aat 
1063 

Ala Leu Gly Gly Phe Leu Tyr He Leu Ala Glu Tyr His Pro Asp Asn 
330 335 340 

gga gat gca cat eca gaa tac tac cca aac cat gga gat eca tac tea 
1111 

Gly Asp Ala His Pro Glu Tyr Tyr Pro Asn His Gly Asp Pro Tyr Ser 
345 350 355 

tec tea cag tea eca gea age aac tac caa gat ggt get gee ggc eet 
1159 

Ser Ser Gin Ser Pro Ala Ser Asn Tyr Gin Asp Gly Ala Ala Gly Pro 
360 365 370 

gac gtc eag agg act ggc agg ccc ctt agt gtt tct taagtcctga 
1205 

Asp Val Gin Arg Thr Gly Arg Pro Leu Ser Val Ser 
375 380 385 

gteagaggte aeaggctgag gaggcaattg aggaaagtga ccagctatat ccceategee 
1265 

acttctgggt gtttaaaagt ettgggagag cagggecagg gaaagcaggg ttggagagtg 
1325 
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gggtggccca gatgtcagca gaatacataa agcacagtca attggagctg aaaaaaaaaa 
1385 

aaaaagggcg gccgc 
1400 
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1 , [ . I , I . I . I . 1 . I . 1 . I i I • I ' I ' I ' I • I ' t ' r I ' I 

1 41 81 121 161 m 241 281 321 3B1 
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GAP of: Fr<3cgManager_686_DBF0L.To2 check: 4995 from: 1 to: 1083 
hT300 ORF (analysis only) - Import - complete 

to: FrGcgManager_686_EBF03Uvti check: 4265 from: 1 to: 1155 
mT300 ORF (analysis only) - Import - complete 



Gap Weight: 
Length Weight: 

Quality 
Ratio 

Percent Similarity 



12 Average Match: 10.000 

4 Average Mismatch: 0.000 



8110 Length 
7.488 Gaps 
77.726 Percent Identity 



1174 
7 

77 . 726 



IjU/^A'V 1 ATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACC 50 

Mill II II IIIIIIIMMII II I III II llllllll 

/YVUW^t- ^ ATGGCGAGACTCGGGCTGCTTCTCCTCCTGCTGCTG. . . GCCCTGCCACC 4 7 
51 GCTGTGGTCCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCA 10 0 

I I lllllll II I IN llllllll III I II 

4 8 AC. . .ACTTCTCCTCAGTGTCATGGCCAGACACTGC. .ACAG. . . .GGCA 8 8 
101 CCATTGCAGACCTGATCCTGTCTGCGCTGGAGAGAGCCACCGTCTTCCTA 150 

nil III II llllllll INI I II I lllllll I III I 

8 9 CCATGGCAAACTTGATCCTGACTGCATTAGAAAAAGCCACCTTGTTCTTG 13 8 
151 GAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGTGGGGGTCCGAGT 2 00 

II I llllllll IMIIIillllllll lllllll III III 

13 9 GAGGACAGGCTGCCCACAATCAACCTGGATGGTGTGGTGGGCTTCCAAGT 188 
2 01 GCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCC 2 50 

llllllll III II II III III II Mill II mil 

18 9 GCTGGAAGTGCAACTCCGAGGAGTTCAGGAAAAATGGGCTCACAAGCCCT 2 38 

2 51 TGCTGCAACCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAG 3 00 

lllllll II II III llllll II I III II III 

239 TGCTGCAGCCTCTCAGCATGCGCGCTGGACAGATGGCCAACACACTGTCT 2 88 

3 01 GCTGCCATCCAGAGATCCCTCCACTACCTCAAGCTGAGTGATCCCAAGTA 350 

III I Mil I MM II IIIIMIIIII IMIII MM Ml 

28 9 GCTCTCCTCCAAAAATCCATCTTCTACCTCAAGCAGAGTGACCCCACGTA 33 8 

3 51 CCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGGGTTTTGGAAGCTCCCAC 4 00 

lllllllllllllllil i I I Mill IIMMIIMII I II 

335 CCTAAGAGAGTTCCAGCOU^GCATTCAGCCTGGGTTTTGGAAGTTGCCCA 38 8 

4 01 ATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGCCC 4 50 

III Mill I III IIIIIMI I II llllll III III 
389 ATGACTGGACACGCACCAATGCCTCCCTAGTCTACCCCTGGCTGGAACCC 438 

4 51 CAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCT 500 
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I llllll lllllllllll II II M llllllllllllll II M 

439 CTGGACTCTTTCTCAGAGGAAAGCAGCGATGTGTGCCTGGTGCAACTACT 4 88 
501 GGGAACCGGGACGGACAG(^GCGAGCCCTGCGGCCTCTCAGACCTCTGCA 550 

iiiii Mill iiiiiiiii nil III I Mill II Mini 

4 89 AGgUcV^GGGACAGACAGCAGCCAGCCTTGCAGGCTCTCCAACTTCTGCA 538 
551 GGAGCCTCATGACCAAGCCCGGCTGCTCAGGCTACTGCCTGTCCCACCAA 600 

I I III IIIIIIIII IIIIIIIIIMIIMII MM IIIII II 

539 GAACC<:TTATGACCAAGGCCGGCTGCTCAGGCTACAGCCTCTCCC^^ 588 
6 01 cTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGGTGCACACAGGG ACCACT 6 50 

MlllllllllllllllillllMMI IIMIIM MUM I 

58 9 CTGCTCTTCTTCCTCTC ^38 

6 51 CCAACAGAGCCRGGACTATATCAACCTCTTCTGCGCCAACATGATGGACT 7 00 

M IMMIII MM II II IIIMII Mill IIMIIM_ 

639 CCTCITAGAGCCAACACTACATGGACATCTTCTGTGCCAATATGATGGAAC 6 88 

7 01 TGAACCGCAGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATC 7 50 

MUM Mlllllllllll I llllllll IIIII MM III II 

689 TcUccy^CyiGAGCTGAGGCCGTTG^ 738 
751 TTCATGGAAAACATCATGTTCTGTGGAATGGGCGGCTTCTCCGACTTCTA 8 00 

MIMMMMMI IIIMIIIIII Mil llllllll IIIIMII 

73 9 tTCATGgUaACATTATGTTCTGTGGTATGGCTGGCTTCTCTGACTTCTA 7 88 

8 01 cAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGAAACAGCAGGAAG 8 50 

M 1 1 M II 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I MM I 

7 89 cjJ^GCTGCGCTGGCTGGAGGCCATTCT 8 38 

8 51 GATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTATT 9 00 

MIMMIIM Mini I I I I Ml Ml HI II 

8 39 GATGCTTCGGGAGGCCTGACAC . . . AAAGGGTGAACCTTCTGAAG . . . TT 8 82 
901 CAATATCAGCAGCATTTTTCGAGGAGAGTGAAGAGGCGAGAAAAACAATT 950 

I I llllllll II Ml llllll Mill MMIII II 

8 83 CCACATCAGCAGGGCATTCTGAGAAGAGTGCGAAGGCGGGAAAAACTGTT 9 32 

933 CGCAGATGGCTGTTCGTGCCACAACACAGCCACAGCAGTCGC^^ 98 2 

1001 GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCA 105 0 

1051 CACCCATCCACACCGCCACCACCAAGCAGCCGC 1083 

1033 cJvTCCA.GAATACTACCCAAACCAtLaLtCCATACTCATCCTC^ 1081 
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GAP of: FrGcgManager_6e7_IBFGliaq_ check: 8297 from: 1 to: 361 
hT300 prot (analysis only) - Import - complete 
to: FrGcgManager_68 7_JBFn\T7mm_ check: 9127 from: 1 to: 385 
mT300 prot (analysis only) - Import - complete 



Gap Weight: 12 

Length Weight: 4 

Quality: 1237 

Ratio: 3.427 

Percent Similarity: 74.930 



Average Match: 2.778 

Average Mismatch: -2.248 

Length: 391 

Gaps : 4 

Percent Identity: 69.577 



1 MASLGLLLLLLLTALPPLWSSSLPGLDTAESKATIADLILSALERATVFL 5 0 

II IIIIMIII nil • II- llh l-MII-llhlMI 

/^aAS£ 1 MMU^LLLLLLL.ALPPHF.SSVSWPDTAQ. .GTMANLILTALEKATLFL 4 6 
51 EORLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRVGMLGEKLE 100 

I 111 IIIIMI -III Ih Mill -lllllll'-l I I 

4 7 EDRLPTINLDGWGFQVLEVQLRGVQEKWAHKPLLQPLSMRAGQMANTLS 96 

101 AAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGP 150 

I :|:|: ||| III llllll •HIIIIIII- I Mlllll I ^ 
97 ALLQKSIFYLKQSDPTYLREFQPSIQPGFWKLPKDWTRTNASLVYPWLEP 14 6 

151 QDSFSEERSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQ 200 

MUM lllllllllllllllhll il- ll-MII IIMI MM 

14 7 LDSFSEESSDVCLVQLLGTGTDSSQPOT 196 
201 LLFFLWARMRGCT(:K5PI^SQDYim.FCANMMDUTRRA^ 2 50 

I I I M II 1 1 • M h I Ml I . • M M 1 1 h II III h M M M • M- ^ 

15 7 LLFFLWAR>K}<3CTEGLFLQS^ 246 
251 FMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFGEPDAEDEELSKAI 3 00 

247 UUiMfULUsdWk^ 294 
3 01 OYCK)HFSRRVKRREKQFPDGCSSHNTATAVAALGGFLYlLAEYPPANREP 3 50 

:|| llhMII I MM MIIIIMIIMMIIMM I I 

295 PHQC^ILJ^VRRREKLFAIXSCSCHNTATAVAALGGFLYIU^ 344 

351 HPSTPPPPSSR 

34 5 HPEYYPNHGDPYSSSQSPASNYQDGAAGPDVQRTGRPLSVS 385 
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:r.puc file T245Alhbabi55e5 : Cucpuc rile T245Aihbabi65e5 . pac 
Sequence lengch 1356 



GTCGACCCACGCGTCCGGCAGCCTCCAGCCCGCAGCCCGCAGCCCGGAGCCAGATCGCGGGCTCAGACC.\AACCCGACT ^'J 

CGAGCGCCGCCCCCAGCCAGGCGCC ATG CTG CCG CTT CTG CTG GGC CTG CTG GGC CCA GCG GCC ^^'^ 

CWALG?T?G?G5 S ElRiArS 
TGC TGG GCC CTG GGC CCG ACC CCC GGC CCG GGA TCC TCT GAG CTG CGC TCG GCC TTC TCG W 

AARTTPLEGTSEMAVTfDiCV 
GCG GCA CGC ACC ACC CCC CTG GAG GGC ACG TCG GAG ATG GCG GTG ACC TTC GAC AAG GTG 2^3 

YVMIGGDFDVATGQrRCRVp 
TAG GTG AAC ATC GGG GGC GAC TTC GAT GTG GCC ACC GGC CAG TTT CGC TGC CGC GTG CCC 323 

GAVFFS-FTAGKAPHKSLSVM 
GGC GCC TAG TTC TTC TCC TTC ACG OCT GGC AAG GCC CCG CAC AAG AGC CTG TCG GTG ATG iV3 

L V R N R D £ V 0 A A ? D E Q P. R P q 

CTG GTG CGA AAC CGC GAC GAG GTG CAG GCG CTG GCC TTC GAC GAG CAG CGG CGG CCA GGC H^S 

ARRAASQSAMLQL DYGDTVW 
GGG CGG CGC GCA GCC AGC CAG AGC GCC ATG CTG CAG CTG GAC TAC GGC GAC ACA GTG TGG 

L R L H G A ? Q ALGA P G A T F S G 

CTG CGG CTG CAT GGC GCC CCG CAG TAC GCG CTA GGC GCG CCC GGC GCC ACC TTC AGC GGC ^<.3> 

YLjVYADADADA?ARG??APP 
TAC CTA GTC TAC GCC GAC GCC GAC GCT GAC GCG CCT GCG CGC GGG CCG CCC GCG CCC CCC 

EPRSAFSAARTRS LVGSDAG 
GAG CCG CGC TCG GCC TTC TCG GCG GCG CGC ACG CGC AGC TTG GTG GGC TCG GAC GCT GGC ^i^> 

PGPRHQ ? LAFDTE FVNIGGD 
CCC GGG CCG CGG CAC C.^A CCA CTC GCC TTC GAC ACC GAG TTC GTC AAC ATT GGC GGC GAC 7^1 

F D A A A G V 7 a C R 1 P GAY ? ? S F 
TTC GAC GCG GCG GCC GGC GTG TTC CGC TGC CGT CTG CCC GGC GCC TAC TTC TTC TCC TTC 

T G K L ? R :< T L S V K L :\ K 11 K D E 

ACG CTG GGC AAG CTG CCG CGT AAG ACG CTG TCG GTT AAG CTG ATG AAG AAC CGC GAC GAG^*-^ 

V-QAMIYDDGASRRREMQSQS 
GTG CAG GCG ATG ATT TAC GAC GAG GGC GCG TCG CGG CGC CGC GAG ATG CAG AGC CAG AGCSZ^ 

VMLALRRGDAVWL LSHDHDG 
GTG ATG CTG GCC CTG CGG CGC GGC GAC GCC GTC TGG CTG CTC AGC CAC GAC CAC GAC QGC%1> 

YG AYSNKDL PTDL KTVLPSW 
TAC GGC GCC TAC AGC AAC CAC GAT CTC CCA ACT GAC CTC AAA ACG GTT TTG CCG AGT TGG"^^ 

DVHCCQVNQRFELCIGVIPE ^ 
GAC GTC CAC TGC TGT CAA GTC AAC CAG AGA TTT GAA CTG TGC ATT GGT GTG ATC CCT GAG"03 

ESQHWDDA I RMDTDL* 
GAA AGT CAG CAC TGG GAT GAC CCC ATC AGG ATG GAT ACA GAC CTC TAA |/ ^ | 



47/6(1 



WO0l/(M»672 




PCT/L!S(»0/I8184 



CTCATTGAAGCAGGACACCTGCACACATG?lAAGTGAGGGGAGKGGGAACAAAGAGC^^ I23o 
GCTGGAGTCACCrrGGTrrAATGCTGAGAGkAAAGTCCAAGCTTGGGA^ j-v^^cj 

TGCCCACTCCCACAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 13 5 6 
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if ff'T" 



'f\i \ 1 ^':.J ^ \ 



81. 



! ' I M ■ 1 



121 



I ' 1 ' 



i ■ I 



161 ^^-01 241 281 321 
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iequencG ■.•angch 1388 

:TCCACC>\:.CGCGTCCGGGCTCCCTCCCTGGCTCACCrrCCCTrrCCCTCACGCCTCCCCACGCCCGGCC7CGGGCCCC H 

AGCACCCTGTCCGCCGCCGCCTCAGAGCCGGGAGA.-.GCAGCCGGAGCCCCCGGCGCCCCCCCCCCAGCGCGGGCGGTCA .Sy 

GTGCGCAGCCCGGCGCCCGCAGCCCGCAGCCCGC.-.GCCGGATCGCGGGCTCGGACCGAACCCGACCCGACCGCCGCCCC 237 

M L ?:-■-- 2 L L G ? A A C .( A 
CAGCCAGGCTCC ATG CTG CCG CTT CTG C7A GGC C~G CTG GGC CCA GCG GCC TX:C TGG GCC 2-^7 

. J, p pG ?C£3£LR 5 Ar SAA ^ 

CTG GGC CCG GCC CCC GGC CCG GGC TCC TCA GAG CTG CGG TCG GCC TTC TCG GCG GCA CGC 2S 7 

T T P L E G A S E A V T r 0 K V V V N 

ACC ACT CCC CTG GAG GGC. GCG TCG GAG ATG GCG GTG ACC TTC GAC AAG GTG TAC GTG AAC 

r G G D f' D A A T G Q F R C R . V P G A Y 
ATC GGG GGC GAC TTC GAC GCG GCC ACA GGC C.AA TTC CGC TGC CGC GTG CCC GGC GCC TAC ^11 

- V G .1 ? H K 3 1 S V M V R 

TTC TCC TTC ACG GTT GGC f-J^G GCC CCG CAC AAG AGC CTG TCG GTG ATG CTG GTG CGG ^ W 

M H D E V Q A L A F 3 E Q R R P 5 A R R 
AAC CAC GAC GAG GTG CAG GCG CTG GCC TTC GAC GAG CAG AGG CGG CCC AGC GCA CGG CGC S'7 

^ S Q S A M L Q D Y G D T V W L R L 

GCC GCC AGC CAG AGC GCC ATC CTG CAG CTC GAC TAC GGC GAC ACA GTT TGG CTG CGG CTG 

H G A P Q V A L G A ? G A T F S G r L V 
CAT GGC GCC CCG CAG TAC GCG CTG GGC GCG CCC GGC GCC ACC TTC AGC GGC TAC CTG GTC ^'7 

V A D A D A D A ? A R G ? P A ? ? Z ? R 



tIc GCC GAC GCC GAC GCC GAC GCG CCT CCG CGC GGG CCG CCC GCG CCC CCC GAG CCG CGC ^77 

T?G g5a TTC T?G GCG GCG CGC ACG cSc ..^^ CrC GTC GGC TCG GAC GCG GGG TCC GGG CCG V37 

CGG clc CGG CCG CTA G^C TTC GAC ACC GAG CTC GTC A^C A-^ GGC GGC GAC TTC GAC GCG &<57 

VFRCRLPGAYFFSFTLG 
GCG GCC G^ GTG TTC CGC TOC CGC CTG CCC GGC GCC TAC TTC TTC TCC TTC ACG CTG GGC ,S7 



flo CTG C^ CGT AAG ACG CTG TCG GTG AAG CTG ATG AAG AAC CGC GAC GAG GTG CAG GCC i 0*7 

A?G Air tIc GAC GAC GGC G^G T^G CGG CGC CGC GAG A?G CAG A^ cJg AGC GTC ATG CTG (077 

RRGDAVWL'-.SHDHDGYGA 
GCC CTG CGG CGC GGC GAC GCC GTC TGG CTG CTC AGC CAC GAC CAC GAC GGC TAC GGC GCC 3/ 

tIc AGC A^C CAC G^ aIg TAC ATC ACC T^C TCC GGC T^C T^G GTC tIc CCC GAC CTC G^C 1^,7 
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;CC zee GCC CCG CCA GGC CTC GGG GCC . "--A C .o C :^ .o^ , 
jCCCCGGGCCAGAGAGGTGCCCGGGAGGGCO.GGCCCaTGCPvTGCCCGGCCAGGCCCCCGGCCCGAACCCCTCGCGCGA 
GCGCCACCCCCGCCGGGCGCGCCTGGACTCTGCCAATP.AAGCAGAAAGCGGGCCTGCGCATTGCCCGGGC 'ZW 
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'•'~*ZOU calculACes ^ ^ 
versLon 2.0uPiease 
- hT245 a. a. 
'• monkT24 5 a. a. 
:5coririg macrix: r:ami 
r14,B% idencicy; 



PCT/USOO/18184 



-cat ^.^aoment: zz -v.o sequences 
-e: Mvers andMiiier. C.\BI0S :989) 

34B aa 
329 aa 

:C.nac, Tip penalties: -12/-** 

3lobal aiignmenc ^core: i;2I 



HUMAN 



10 :o 30 40 50 60 70 

MLPLLL^LLG?AACV;AUiPTPGPGSSELRSAFSAARTTPLEGTSE>IAVTFDKVYVNIGGOFDVATGQFR^ 



MLPLLLGLU:?AACV; 
10 



MX;?APGPGSSELRSAFSAARTTPLEGASEMAVTrDK\rrVNrCGDrDAATGQFRC 
20 30 40 SO 60 70 



go 90 100 110 120 130 140 

RVPGAYFFSFT'AGKJ;PHKSLSV>aVTlNRDEVQAU\i^DEQRRPGAJlRAASQSAKLOLDYGDT'v^^^ 

RVPGAYFFSFTVGKAPHKSLSVML\mNKDE^^QMJ^DEQRHPSARR^ 

80 90 100 110 120 130 140 

150 160 170 180 190 200 210 

QYAl^APGATFSGVLVYADADADAPARGPPAPPEPRSAFSAARTRSLVGSDAGPGPRKQPLAFDTEFVNI 

OYALGAPGATFSGVLVYA^ 

150 160 170 180 190 200 210 

220 230 240 250 260 270 280 

GGDFDAAAGVFRCRLPGAYFFSFTLGKLPRKTLSVKUiKNRDEVQAMlYDDGASRRREM 

r^nrnAAAC^RCRLPGAYFFSFTLGKLPRKTLSVK^^ 

220 230 240 250 260 270 280 

290 300 310 320 330 340 

GDAVWI^SKDHIX5YGAYSNHDLPTDtJ;TVLPSWDV>iCCgVNQRFELCrGVlPEESQKWDDAIRK^ 



GOAVWLLSKDHDG YGXY SNHG - 
290 300 



-iori- 



_TFSG FLVYPDLAPAAPP GLGAPELL 

310 320 
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GyysO->5t^ '-^ArcvirscnrpPaEmsnpgqpViFdeVLyNqqghYdpaTGkFtCkvP 
AF-^- rfC pf* ^ V Fd+V +N^-+g+ + d' aTG F C vP 

n -?'5AARTT--?LE--GTSniAVTrDICVYVNIGGDFDVATGQFRCRVP 73 

00^x9 ^l-YyFsFhvsskgcRqnvcVsLiiirSSrngvrqkiVraetcdeyakgcyqvaS 
_ G Y fsF ^ - -.^^ V L^-r r+ e g + +aS 

/4 GAYF^S^TAGKAP-HKSLSV^fLVT^-^mDE:VQ--ALAFDEQRRPGARRAAS 119 

C:^>0$e>^5</J GGavLcLrqGDrWLelddkqnngilggegvhSvFSGFLl< - - 
-a LqL GD VWL I > ^ 1 +^FSG + L + 

120 QSAMLQLDYGDTVWLRLHGAPQYALGAPG ATFSGYLV 156 



Cc?t0^ek^ • ->AFtvirscnrpPaExnsn pgqpViFde\ALyNqqghYdpaTGkF 

AF + ^ r+ i- s-K + + +++ qp Fd+ +N++g + +d"a G^-F 
178 AFSAARTR-SLVG--SDagpgpRHQPLAFDTEFVNrGGDFDAAAGVF 221 

C£3U5<4i-^ ^^^vPC^'-^yf'sFhvsskgcRqnvcVsLnirSSrncrvrqkVmef cdeyakgn 
C i-PG Y FsF - + * V Lm+ r^-'v+ -d ' a 

222 RCRLPGAYFFSFTLGKX?-RKTLSVKIJlK-brRDEVQ--.\WI--fDDGASRR 256 

CoOtii^^t-^ yqvabGGa vLqLrqGDrVWLeiddkqcngl<- - 

S L-rLr+GD VWL i- d +g+ 

267 REJlQSQSVMLALRRGDAVVrLLSHDH--DGY *' 294 
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' ->AFtvirscnrpPa£.-nsnDaqpN/LFdeVLyNqqghYdpaTGkFtCkvP 
AF + ^ r^c o** ^ V Fd+V ^'N+-+g^-+d aTG F C vP 

3 : AFSAARTT--?LE--GASEMAVTFDKVYVNIGGDrDAATGQFRCRVP 7 3 

GlYyFsFhvsskgcRqnvcVsLmrSSrngvrqkVmefcdeyakgcyqvaS 

Y FsF V ^ V L+r v+ f + +aS 

74 GAYFFSFTVGKAP-HKSLSV-MLVR-NHDEVQ--ALAFDEQRRPSARRAAS 1L9 

GGavLqLrqGDrVWLeiddkqtngilggegvhSvFSGFLl<-* 
LqL GD VWL i ^ ^ 1 *^ ^- + FSG + L + 

120 QSAMLQLDYGDr^/;LRLHGAPQYALGAPG---ATFSGYLV 156 



ClUOSC^u^ .->AFLvirscnrpPaErusn pgqpViFdeVLyNqqghYdpaTGkF 

AF++ r^- ^ s+ + + ^-t-.' p Fd-»" L+N++g++d a G+F 

L7 8 AFSAARTK-SLVG--5DagsgpRHRPLAFDTELVNIGGDFDAAAGVF 221 

r^^^^oj/x ^^ckvPGlYyFsFhvsskgcRqnvcVsUnrSSrngvrqkVmefcdeyakgc 

CC/SJS^JtO ^ ^ ^ ^^^^^ ^ ^ 

222 rcRLPGAYFFSFTLG>XP-RKTLSVKUIK-KRDE:VQ--AMI-YDDGASRR 266 

^ K ycTvaSGGavLqLrqGDrVVn:.elddkqt:ngilggegvhSvFSGFLl<- * 

OJ^5£A^t/= S L.Lr^GD VWL . d --g g + -^**FSGFL+ 

2 67 REMQSQSVMIALRRGDAWn.LSHDHXX;YGAYSrJHGKYITFSGFLV 311 
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tcggaccgcc cgccaccagc cacgtgcc atg ctg ctg etc ttg ctg ggc etc 52 

Met Leu Leu Leu Leu Leu Gly Phe 
1 5 

eta ggc ccg gcg gee tgc tgg gca ctg ggc ccg get ggc cct ggc tec 100 

Leu Gly Pro Ala Ala Cys Trp Ala Leu Gly Pro Ala Gly Pro Gly Ser 

10 15 20 

teg gag ctg egg tea gcc ttc teg gcg get cgc acc ace ccg ctg gag 14 8 

Ser Glu Leu Arg Ser Ala Phe Ser Ala Ala Ary Thr Thr Pro Leu Glu 

25 30 35 40 

ggc acg teg gag atg gcg gtg acc ttc gac aag gtg tac gtg aac ate 196 

Gly Thr Ser Glu Met Ala Val Thr Phe Asp Lys Val Tyr Val Asn lie 

45 50 55 

999 99^ ttc gac gca gcc acc ggg egg ttc cgc tgt cgc gtg ccg 244 

Gly Gly Asp Phe Asp Ala Ala Thr Gly Arg Phe Arg Cys Arg Val Pro 

60 65 70 

ggc gcc tac ttc ttc tec ttc acg gcc ggc aag gcc ccg cac aaa aac 292 

Gly Ala Tyr Phe Phe Ser Phe Thr A}a Gly Lys Ala Pro Hin Lys Asn 
75 80 85 

ctg teg gtg atg ctg gtg cgc aac cgc gac gaa gtg cag gcg ctg gcL 340 

Leu Ser Val Met Leu Val Arg Asn Arg Asp Glu Val Gin Ala Leu Ala 

90 95 100 

ttc gac aag cag cga egg cea ggc gcg egg cgc gcg gcc age caa age 38G 

Phe Asp Lys Gin Arg Arg Pro Gly Ala Arg Arg Ala Ala Ser Gin Ser 

105 110 115 120 

gee atg ctg cag etc gac tac ggc gac acg gtg tgg ctg egg ctg cac 436 

Ala Met Leu Gin Leu Asp Tyr Gly Asp Thr Val Trp Leu Arg Leu His 

125 130 135 

ggc get ccg cat tac gcg etc ggc gcg ccg ggc gcc acc ttc age ggc 484 

Gly Ala Pro His Tyr Ala Leu Gly Ala Pro Gly Ala Thr Phe Ser Gly 

140 145 150 

tac ctg gtg tac gcg gac gcc gac gee gac gcg cct gcg cgc ggg cec 532 

Tyr Leu Val Tyr Ala Asp Ala Asp Ala Asp Ala Pro Ala Arg Gly Pro 
155 160 165 

gcg gcc ccg gag ccg cgc teg gcc ttc tec gcg cgc cac gcc acc tgg 580 

Ala Ala Pro Glu Pro Arg Ser Ala Phe Ser Ala Arg His Ala Thr Trp 

170 175 180 

tgg get ccg aac ccg cec egg ecc gcg eca egg cgt ttg gcc ttc 625 

Trp Ala Pro Asn Pro Pro Arg Pro Ala Pro Arg Arg Leu Ala Phe 

185 190 195 
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GAP of: rrGcgManager_266_RYBH0yQG_ check: 1120 from: 1 to: 625 
mM245 (analysis only) - Import - complete 

to: FrGcgManager_266_SYB96sGK_ check: 9121 from: 1 to: 27^7 
hM245 (analysis only) - Import - complete 



Gap We iqh t : 
Length Weight: 



12 Average Match: 10.000 

'5 Average Mismatch: 0.000 



Quality; 540^] Length 
Ratio: 8.646 Gaps 
Percent S imi la r i t y : , 8 9 . 5 67 Percent Identity 



2 74 9 
10 

89 . 567 



Match display thresholds for the alignment (s) 

I - IDENTITY 

: = 5 

. = 1 



PrGcqManager_266 RYBHOyQG_ x FrGcgManager 266 SYB96sGK May 30 19100 
15:01.. ~ - .._ . 



^OOOUiC 1 TCGGACCGCCCGCCACCAGCCA 2 2 

, I I ! I I I I I I f I I M I I M I 

htN^n^n 51 CAGATCGCGGGCTCAGACCAAACCCGACTC.GACCG.CCGCCCCCAGCCA 98 

2 3 CGTGCCATGCTGCTGCTCTTGCTGGGCTTCCTAGGCCCGGCGGCCTGCTG 72 

I I I I I I I M I I III I M I M M I M I I t I I M II I M II M 
99 GGCGCCATGCTGCCGCTTCTGCTGGGCCTGCTGGGCGCAGCGGCCTGCTG 14 8 

7 3 GGCACTGGGCCCG. . .GCTGGCCCTGGCTCCTCGGAGCTGCGGTCAGCCT 119 
IN M I I I M I I I Mill II Mill M I II M I II MM 

14 9 GGCCCTGGGCCCGACCCCCGGCCCGGGATCCTCTGAGCTGCGCTCGGCCT 198 

120 TCTCGGCGGCTCGCACCACCCCGCTGGAGGGCACGTCGGAGATGGCGGTG 169 

I I M I M I II II I II M II I I I M M II I M M M M M M M M M I 
199 TCTCGGCGGCACGCACCACCCCCCTGGAGGGCACGTCGGAGATGGCGGTG 24 8 

170 ACCTTCGACAAGGTGTACGTGAACATCGGGGGTGACTTCGACGCAGCCAC 219 

1 M M I I I M II I M II I I M I M I M II I M II M II I I I I II I I 
24 9 ACCTTCGACAAGGTGTACGTGAACATCGGGGGCGACTTCGATGTGGCCAC 2 98 

220 CGGGCGGTTCCGCTGTCGCGTGCCGGGCGCCTACTTCTTCTCCTTCACGG 269 

IN I Ml Mill II II I II I t I I M M I II I M II I I M I I M M 
299 CGGCCAGTTTCGCTGCCGCGTGCCCGGCGCCTACTTCTTCTCCTTCACGG 34 8 

27 0 CCGGCAAGGCCCCGCACAAAAACCTGTCGGTGATGCTGGTGCGCAACtGC 319 
I I II M II M M I M ! M i I M I I II M II M M M II M I M M I 

34 9 CTGGCAAGGCCCCGCACAAGAGCCTGTCGGTGATGCTGGTGCGAAACCGC 398 

32 0 GACGAAGTGCAGGCGCTGGCTTTCGACAAGCAGCGACGGCCAGGCGCGCG 369 
Mill I II II M II M II I M 1 M I M M I II M II M M II M 11 

3 99 GACGAGGTGCAGGCGCTGGCCTTCGACGAGCAGCGGCGGCCAGGCGCGCG 448 
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KVJOUSC 37 0 GCGCGCGGCCAGCCAAAGCGCCATGCTGCAGCTCGACTACGGCGACACGG 419 

U.,^;:»i^ H I i I I I I I I I I I I I t I t I I I I I I I I I I I I M I I I I I I ! M I 

Kvurric?»n GCGCGCAGCCAGCCAGAGCGCCATGCTGCAGCTCGACTACGGCGACACAG 4 98 

4 20 TGTGGCTGCGGCTGCACGGCGCTCCGCATTACGCGCTCGGCGCGCCGGGC 4 69 

I I I I I 1! I I I I I I I 1 I Mill Mill I I I I I I I I I I I I I I I I Ml 

4 99 TGTGGCTGCGGCTGCATGGCGCCCCGCAGTACGCGCTAGGCGCGCCCGGC 54 8 

4 7 0 GCCACCTTCAGCGGCTACCTGGTGTACGCGGACGCCGACGCCGACGCGCC 519 

M M M M I M M M I I M I M Mill II M I M I M I M M M II 

54 9 GCCACCTTCAGCGGCTACCTAGTCTACGCCGACGCCGACGCTGACGCGCC 598 

520 TGCGCGCGG. . .GCCCGCGGCCCCGGAGCCGCGCTCGGCCTTCTCCGC-G 565 

M M II M I M I M M MM I I I M I M M M M M I M I M I 

599 TGCGCGCGGGCCGCCCGCGCCCCCCGAGCCGCGCTCGGCCTTCTCGGCGG 64 8 

5 66 CGC . CAC^C . G^ . CCTGGTGGGCTCCGA . ACCCGCCCCGGCCCGCGCC A . 610 

I Tl MM I II I 11 M M M I I II I M II II I M M I II 

64 9 CGCGCACGCGCAGCTTGGTGGGCTCGGACGCTGGCCCCGGGCCGCGGCAC 6 98 

611 CGGCGTTTGGCCTTC 625 

I I I M M I I 

699 CAACCACTCGCCTTCGACACCGAGTTCGTCAACATTGGCGGCGACTTCGA 74 8 
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10 20 30 40 SO 

MAA-QYGSMSFNPS - - -TPQASYGPG- - - -RQEPRNSQLR- - - - IVLVGKrOAQKSATGNSILGRKVFHS 

MANDQKNSESFPAKEDHKKDDAAAPAE^raHKDEFSA^ 

10 20 30 40 SO 60 70 

70 80 90 100 110 120 

GTAAKSITKKCEKRSSSWKETELV-VVDTPGIFDTEVPNAETSKEIIRCILLTSPGPHALLLVVPL-GRy 

K^^CSSGVT^IECHAVKAVTPEGPIUn^IDTPGLFDLSVSAEFIGKEIVKCLT^^ 

aO 90 100 110 120 130 140 

130 140 150 160 170 180 190 

TEEEHKATEKILKMFGERARSFMILIFTRKDDLGDTN-IJnDYIJlEAPEDIQDIJffilF-GDRYCA^ 

SOEEEMVLSTLQVLFGSKIVDYLIVVFTGGDVL^ 

150 - 160 170 180 190 200 210 

200 210 220 230 240 250 260 

TGAEQEAQRAQ - LLGLIQR^A^^ENKEGCYT^^U«QRAEEEIQKQTQAMQEIJI-RVE1^KARIR^^ 

KDDEKKTKQVHEIA-KLIDLVRKQNNNIPrroEKYHHIKEEKERHK^ 

220 230 240 250 260 270 280 

270 280 290 300 310 320 

KI - Rja.EDKVEQ - EKRKK- Q^^E - - KXIJ^QEAHYAWQQRARTEVESKDGILELIOTAI^IASFII^ 

MNERNiKA^tAEMMEm^KIRMEAQElCLFEQREKAQEKSyQ<5KME^WEK^ 

290 300 310 320 330 340 350 



AED 
SIL 
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ALIGN calculAces ^^iobal iiignmenc o: cwo . . ^ 

version 2.0uPlease cite: Myers and MiUer. CABIOS <;;9B9) 
-y^-^-fc - . 12S3 aa vs. 

- rTiT275 a. a. 

> Pacenc Protein R79475 - uaciciedl ^^^^ 

scoring matrix: painl20 .mac. gap penalties: 

97.4% identity; Global alignment score: 6/^0, 

10 20 30 40 50 60 

inputs MRQAGGU;LtJU.LLIJaLGPGGRGVGRPGSGAQAGAGRWAQRFKVVrAPVICKRTC 

mrqaa-ix^lialLll^ll^^pggr^^^ 

10 20 iO 40 50 

70 80 90 100 110 120 

inputs CRDSCQQGSNMTLrGE^GHSTDTLTGSAFR\nA/CPLPCMNGG<KSSRNQCL^ 

crdscqqgsnmtligenghstdtltgsafrvwcplpchng^ 

60 70 80 90 100 110 

130 140 150 160 170 180 

inputs cqvtaagtgagtgssgpgwpdrakstgplpplapegesvaskhaiyavqviadppgpgeg 

CCWT AAGT^GTG S SG ^ 
120 130 140 150 160 170 

190 200 210 220 230 240 

inputs PPAQHAAFLVPLGPGQISAEVQAPPPVV^^/RVHHPPEASVQVHRIEGPN^£GPASSQH^ 

PPAQHwixVPLGP^ 
180 190 200 210 220 230 

250 260 270 2B0 290 300 

inputs PHPKPPHPRPPTQKPLGRCFQDTLPKQPCGSNPLPGLTKQEDCCGSIGTAW(^SK^ 

potkpqhprpp^qkplgrcfo^ 

240 250 260 270 280 290 

310 320 330 340 350 360 

inputs Qt^YTCVQKFVPVRGEVGADCPQGyKRr^STHCQDIKECAMPGKVC^^ 

300 310 320 330 340 350 

370 380 390 400 410 420 

inputs CPPGKSLGPU^QCIADKPEEKSI^FRLVSTEHQCQHPLTTRLTRQLCCCSVGKA^ 

CPrcHSI^PLiAQCIADKPEE^ 
360 370 380 390 400 410 

430 440 450 460 470 480 

inputs QRCPAIXrrAArKEICPGWERVPYPHLPPDAHHPGGKRLLPLPAPDGPPK^ 

QRCPADCTAAFT^ 
420 430 440 450 460 470 

490 500 510 520 530 540 

inputs PPLEOTEEE31GVTMDPFVSEIUISVQQSHPTTTTSPPRP 

ppi^OT^^ 

480 490 500 510 520 530 

550 560 570 580 590 600 

inputs SRSAVEIAPTXJVTETDECRU^QNICGHGQCVPGPSDYSCHCHAGYRSH^ 

SRSAVEIAmiWCT 
540 S50 560 570 580 590 

610 620 630 -640 650 660 

inputs EAEPCGPGKGICM^^^GGSYNCHCNRGYRLHVGAGGRSCVDLNECAKPHLCGDGGFCrK^ 

EAEPCGPGKGlCt^^ 
600 610 620 630 640 650 
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670 680 690 700 710 720 

inputs GHYKCNCYPGYRLKASRPPICEDIDECRDPSTCPDGKCENKPGSFKCIACQPGYRSQGGG 

G^^i^ccNCYPGyRLIC^sRPP ^ 

660 670 680 690 700 710 

730 740 750 760 770 780 

inputs ACRDVNECSEGTPCSPGWCENLPGSYRCTCAQGIRTRTGRLSCIDVDECEAGKVCQDGIC 

ACRDVNECSECTPCSPGWC 
720 730 740 750 760 770 

790 800 810 820 830 840 

inputs TNTPGSFQCQCLSGYHLSRDRSRCEDIDECDFPAACIGGDCINTNGSYRCLCPLGHRLVG 

TtrrPGSFQCQCLSGTfHLSRDRSR^ 
780 790 800 810 820 830 

aSO 860 870 880 890 900 

inputs GRKCKKDIDECSQDPGI^CLPHACENI^SYVCVCDEGPIXTQDQHGCEEVEQPHHK^ 

G RKCBM IDEC^QD PG^ 
840 850 860 870 880 890 

910 920 930 940 950 960 

inputs LNFDDTVFCDSVlATNVTQCECCCSLGAGWGDHCEIYPCFVYSSAEFHSLVPrcF^^ 

900 910 920 930 940 950 

970 980 990 1000 1010 1020 

inputs QQHC£IX:lPAHRDIDEClLFGAEICKEGKCVm'QPGYECYCKQGFYYT>3NI^C^^ 

00 HciLC i P AHRD i D E C i 
960 970 980 990 1000 1010 

1030 104O 1050 1060 1070 

inputs LDESNCRNGVCEbmiGGYRCACTPPAEYSPAQAQCLIPERWS-TPQRDVKCA 

LX>ESKCRNGVcil^^ 
1020 1030 1040 1050 1060 1070 

1080 1090 1100 1110 1120 1130 

inputs CVWGPWAGPALTFDDCCCRQPRLGTQCRPCPPRGTGSQCPTSQSESNSFWDTSPLLLGKS 

CM-GPl^GPALTFDCKi:CCRQPR£^ 
1080 1090 1100 1110 1120 1130 

1140 1150 1160 1170 1180 1190 

inputs pRDEDSSEEDSDECRCVSGRCVTRPGGAVCECPGGFQrmSRARCVDIDEC^^ 

1140 1150 1160 1170 1180 1190 

1200 1210 1220 1230 1240 1250 

inputs CKSEROmrSGSFRCVCKAGrrRSRPHGPACLSAAADDAAIAHTSVIDHRGYFH 

1200 1210 1220 1230 1240 1250 
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SEQUENCE LISTING 



<110> Millennium Pharmaceuticals, Inc. 



<12 0> SECRETED PROTEINS AND USES THEREOF 



<130> 7853-205-228 



<140> 
<141> 



<150> 09/342,687 
<151> 1999-06-29 



<160> 93 



<170> FastSEQ for Windows Version 4.0 



<210> 1 

<211> 1513 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (85) . . (570) 

<221> modi f ied_base 

<222> all "n" positions 

<223> n = a, c, g, or t 



<400> 1 

gtcgacccac gcgtccggcc cgacacccag accgaccttg accgcccacc tggcaggagc 60 

aggacaggac ggccggacgc ggcc atg gcc gag etc ccg ggg ccc ttt etc 111 

Met Ala Glu Leu Pro Gly Pro Phe Leu 
1 5 

tgc ggg gcc ctg eta ggc ttc ctg tgc ctg agt gtt ccc ccc agt aat 159 

Cys Gly Ala Leu Leu Gly Phe Leu Cys Leu Ser Val Pro Pro Ser Asn 

10 15 20 25 

ccc tta tgc agt cag agt gga caa acc tct gtg gga ggc tct act gca 207 

Pro Leu Cys Ser Gin Ser Gly Gin Thr Ser Val Gly Gly Ser Thr Ala 
30 35 40 

ctg aga tgc age tct tec gag ggg get ect aag cca gtg tac aac tgg 255 

Leu Arg Cys Ser Ser Ser Glu Gly Ala Pro Lys Pro Val Tyr Asn Trp 
45 50 55 

gtg cgt ctt gga act ttt ect aca ect tct ect ggc age atg gtt caa 303 

Val Arg Leu Gly Thr Phe Pro Thr Pro Ser Pro Gly Ser Met Val Gin 
60 65 70 

gat gag gtg tct ggc cag etc att etc acc aac etc tec ctg acc tec 351 

Asp Glu Val Ser Gly Gin Leu lie Leu Thr Asn Leu Ser Leu Thr Ser 
75 80 85 



1 



wo 01/00672 




PCT/IJS00/181S4 



teg ggc acc tac cgc tgt gtg gcc acc aac cag atg ggc agt gca tec 399 
Ser Gly Thr Tyr Arg Cys Val Ala Thr Asn Gin Met Gly Ser Ala Ser 
90 95 100 105 

tgt gag ctg acc etc tct gtg acc gaa ccc tee caa ggc cga gtg gcc 447 
Cys Glu Leu Thr Leu Ser Val Thr Glu Pro Ser Gin Gly Arg Val Ala 
110 115 120 

gga get ctg att ggg gtg etc ctg ggc gtg ctg ttg ctg tea gtt get 495 
Gly Ala Leu lie Gly Val Leu Leu Gly Val Leu Leu Leu Ser Val Ala 
125 130 135 

geg ttc tgc ctg gtc agg ttc cag aaa gag agg ggg aag aag ccc aag 543 
Ala Phe Cys Leu Val Arg Phe Gin Lys Glu Arg Gly Lys Lys Pro Lys 
140 145 150 

gag aca tat ggg ggt agt gac ctt egg tgageaggag ggctgggggg 590 
Glu Thr Tyr Gly Gly Ser Asp Leu Arg 
155 160 

tggegeaagg agggaggaaa gggcttgagt taaaagcggg tgcctgeaae cctcaaacte 650 

cgacatcatt cagtgtgttt aggggcagga ggtgttgttc agccgtggaa tttgetggtg 710 

gcagcagtgt aacctgtgta tttgagggta eaggeaageg gtacagggtg gagtggetgg 770 

tccacaaget gtggcaggga agctgtttge aggaetgece tgcccctcct catatttaat 830 

aaagtttact tttctgttcc gaaggtattt tcatatattt taaccacctg ggagtagtag 890 

tqgettgtag atgccaggaa atggatttgt cetgagcagt cagctgagtt eaattettet 950 

gtggaggaaa teaggaaagg ggaggggaaa etgeetetgt eatecaettt agctgecagn 1010 

eagggtetag gatagggatc agageaaeat ttettcaggt ggagtcctca gattaeetgg 1070 

aeagaaatea ccgggaacta gttataeatt cagattcagg ccacttctag ecttectgta 1130 

gttgtgcgtt ggggagtgat naggcceana aatttcnttt taaceaaagt tccncanatt 1190 

attttcaagc ccagtgaaat ttaagagtcc ccaggttaga ggacggeeet ecnccgcagg 1250 

aggnttttae tgkttactca gaacttgect ataceeatca gggaggatgc catcgctect 1310 

gggatctctg agcacacttg tatgagggct gattetagea aggggttcct ggaaagacce 1370 

tegtctgcca gcaccgtgac gaccaceaag tceaagctec ctanggtcgt gtgacttcte 1430 

eegatecctg agggcggtga gggggaatat caataattaa agtctgtggg taccaaaaaa 1490 

aaaaaaaaaa aaagggeggc cgc 1513 



<210> 2 

<211> 162 

<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Ala Glu Leu Pro Gly Pro Phe Leu Cys Gly Ala Leu Leu Gly Phe 

15 10 15 

Leu Cys Leu Ser Val Pro Pro Ser Asn Pro Leu Cys Ser Gin Ser Gly 

20 25 30 

Gin Thr Ser Val Gly Gly Ser Thr Ala Leu Arg Cys Ser Ser Ser Glu 

35 40 45 

Gly Ala Pro Lys Pro Val Tyr Asn Trp Val Arg Leu Gly Thr Phe Pro 

50 55 60 

Thr Pro Ser Pro Gly Ser Met Val Gin Asp Glu Val Ser Gly Gin Leu 
65 70 75 80 

lie Leu Thr Asn Leu Ser Leu Thr Ser Ser Gly Thr Tyr Arg Cys Val 

85 90 95 

Ala Thr Asn Gin Met Gly Ser Ala Ser Cys Glu Leu Thr Leu Ser Val 
100 105 110 
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Thr Glu Pro Ser Gin Gly Arg Val Ala Gly Ala Leu He Gly Val Leu 

115 120 125 

Leu Gly Val Leu Leu Leu Ser Val Ala Ala Phe Cys Leu Val Arg Phe 

130 135 140 

Gin Lys Glu Arg Gly Lys Lys Pro Lys Glu Thr Tyr Gly Gly Ser Asp 
145 150 155 160 

Leu Arg 



<210> 3 

<211> 486 

<212> DNA 

<213> Homo sapiens 



<400> 3 

atggccgagc 

gt tcccccca 

gcac tgagat 

ggaacttttc 

attctcacca 

^tgggcagtg 

gccggagctc 

ctggtcaggt 

c t tcgg 



tcccggggcc 
gtaatccctt 
gcagct c ttc 
ctacaccttc 
acctctccct 
catcctgtga 
tgattggggt 
tccagaaaga 



c t ttctctgc 
atgcagtcag 
cgagggggct 
tcctggcagc 
gacctcctcg 
gctgaccctc 
gctcctgggc 
gagggggaag 



ggggccctgc 
agtggacaaa 
cc taagccag 
atggttcaag 
ggcacctacc 
tctgtgaccg 
gtgctgttgc 
aagcccaagg 



taggc t tcct 
cct c tgtggg 
tgtacaactg 
atgaggtgtc 
gctgtgtggc 
aaccctccca 
tgtcagttgc 
agacatatgg 



g tgcctgagt 
aggctctact 
ggtgcgtctt 
tggccagctc 
caccaaccag 
aggccgagtg 
Cgcgttctgc 
gggtagtgac 



<210> 4 

<211> 1992 

<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
486 



c220> 
;221> CDS 



(94) 



(1080) 



<400> 4 

gtcgacccac gcgtccgcaa catcctggct tagtattgtg tgcaaaacca gagaggggtg 
caagatcctg atttttcagg agttcaagcg aca atg gca gcc caa tac ggc agt 

Met Ala Ala Gin Tyr Gly Ser 
1 5 



60 
114 



atg age ttc aac ccc age aca cca ggg gcc agt tat ggg cct gga agg 
Met Ser Phe Asn Pro Ser Thr Pro Gly Ala Ser Tyr Gly Pro Gly Arg 
10 15 20 



162 



caa gag ccc aga aat tec caa ttg aga att gtg tta gtg ggt aaa acc 
Gin Glu Pro Arg Asn Ser Gin Leu Arg He Val Leu Val Gly Lys Thr 
25 30 35 



210 



gga gca gga aaa agt gca aca gga aac age ate ctt ggc egg aaa gtg 
Gly Ala Gly Lys Ser Ala Thr Gly Asn Ser He Leu Gly Arg Lys Val 
40 45 50 55 



258 



ttt cat tct ggc act gca gca aaa tec att acc aag aag tgt gag aaa 
Phe His Ser Gly Thr Ala Ala Lys Ser He Thr Lys Lys Cys Glu Lys 
60 65 70 



306 
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cgc age age tea tgg aag gaa aca gaa ctt gtc gta gtt gac aca cca 354 
Arg Ser Ser Ser Trp Lys Glu Thr Glu Leu Val Val Val Asp Thr Pro 
75 80 85 

ggc att ttc gac aca gag gtg ccc aat get gaa acg tec aag gag att 402 
Gly He Phe Asp Thr Glu Val Pro Asn Ala Glu Thr Ser Lys Glu He 
90 95 100 

att cgc tgc att ctt ctg acc tec cca ggg cct cat get ctg ctt ctg 450 
He Arg Cys He Leu Leu Thr Ser Pro Gly Pro His Ala Leu Leu Leu 
105 110 115 

gtg gtt cca ctg ggc cgt tac act gag gaa gag cac aaa gcc aca gag 498 
Val Val Pro Leu Gly Arg Tyr Thr Glu Glu Glu His Lys Ala Thr Glu 
120 125 130 13b 

aag ate ctg aaa atg ttt gga gag agg get aga agt ttc atg att etc 546 
Lys He Leu Lys Met Phe Gly Glu Arg Ala Arg Ser Phe Met He Leu 
140 145 150 

ata ttc acc egg aaa gat gac tta ggt gac acc aat ttg cat gac tac 594 
He Phe Thr Arg Lys Asp Asp Leu Gly Asp Thr Asn Leu His Asp Tyr 
155 160 165 

tta agg gaa get cca gaa gac att eaa gac ttg atg gae att tte ggt 642 
Leu Arg Glu Ala Pro Glu Asp He Gin Asp Leu Met Asp He Phe Gly 
170 175 180 

gac cgc tac tgt gcg tta aac aac aag gea aca ggc get gag cag gag 690 
Asp Arg Tyr Cys Ala Leu Asn Asn Lys Ala Thr Gly Ala Glu Gin Glu 
185 190 195 

gcc eag agg gca cag ttg ctg ggc etg ate cag ege gtg gtg agg gag 738 
Ala Gin Arg Ala Gin Leu Leu Gly Leu He Gin Arg Val Val Arg Glu 
200 205 210 215 

aac aag gaa ggc tgc tac act aat agg atg tac caa agg gcg gag gag 786 
Asn Lys Glu Gly Cys Tyr Thr Asn Arg Met Tyr Gin Arg Ala Glu Glu 
220 225 230 

gag ate cag aag caa aca eaa gca atg caa gaa etc cac aga gtg gag 834 
Glu He Gin Lys Gin Thr Gin Ala Met Gin Glu Leu His Arg Val Glu 
235 240 245 

Ctg gag aga gag aaa geg egg ata aga gag gag tat gaa gag aaa ate 882 
Leu Glu Arg Glu Lys Ala Arg He Arg Glu Glu Tyr Glu Glu Lys He 
250 255 260 

aga aag ctg gaa gat aaa gtg gag cag gaa aag aga aag aag caa atg 930 
Arg Lys Leu Glu Asp Lys Val Glu Gin Glu Lys Arg Lys Lys Gin Met 
265 270 275 

gag aag aaa eta gca gaa cag gag get cac tat get gta agg eag caa 978 
Glu Lys Lys Leu Ala Glu Gin Glu Ala His Tyr Ala Val Arg Gin Gin 
280 285 290 295 
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agg gca aga acg gaa gtg gag agt aag gat ggg ata ctt gaa tta ate 
Arg Ala Arg Thr Glu Val Glu Ser Lys Asp Gly lie Leu Glu Leu He 
300 305 310 



1026 



atg aca gcg tta cag att get tec ttt att ttg tta cgt ctg ttc gcg 
Met Thr Ala Leu Gin He Ala Ser Phe He Leu Leu Arg Leu Phe Ala 
315 320 325 



1074 



gaa gat taaacttaat gaaaatctgt ttgtattttc tgcatattct ctggcaacct 
Glu Asp 



1130 



tgccccatac 
gtaactaagg 
aaattttcaa 
ct tccttaga 
t catgt tacc 
ccct tgtagc 
gtggtttttg 
atgat tgtag 
agtcaaattg 
ctctacaaat 
aacagataat 
CO tcaattta 
caacctcacc 
cttctgcccc 
a t c t caaaaa 



ttacttattt 
accaccattg 
tttgtgaaac 
catgcagaga 
tgtaataaaa 
ttccactcac 
agaaagaaag 
aattattcct 
tttcttatca 
tcaattaaat 
agagcaatgt 
accaat ttga 
tccctcactc 
cactgcattg 
aaaaaaaaaa 



agcatagtcg 
gccat tggta 
tccaaagcag 
aaatgtatgc 
tcttttcttc 
ttattcttgc 
atcaattctt 
agatgagtgt 
tttgttcatt 
aa tgatcccc 
ttagtatatt 
tgaataccca 
aacact t tga 
aattttttgc 
aa 



agtgctctag 
gatgtttgat 
aaagtattgg 
aagagaccaa 
tagattcttt 
at tcagagtc 
tgt ttgcagt 
caatttattt 
gaagaacaga 
aaatgctgaa 
cagctgtatc 
gttctcttct 
atact tattg 
ttatgttgtt 



tttctgtctc 
tgacttaaca 
tgcttgc tac 
aaagatggct 
ctatgttggc 
acaatga tea 
aggtaatctt 
aattccattg 
gacctgtctg 
aaagtgaaat 
tgtagaaact 
tttctagaga 
tttggcaggt 
tataataaaa 



teaggcactc 
sgagagggac 
et tgtgaatt 
ccaagcta tg 
agataatctc 
tct tacccat 
agagatggag 
tcatataagg 
gaaaatcgat 
acagcaattc 
ctt tgacgaa 
aagatagttg 
eat ceacaca 
ettttcaatt 



1190 
1250 
1310 
1370 
143 0 
1490 
1550 
1510 
1670 
1730 
1790 
1850 
1910 
1970 
1992 



<2:o> 5 

<211> 329 

<2 12> PRT 

<2 13> Homo sapiens 



<40G> 5 

Met Ala Ala Gin Tyr 
1 5 
Ala Ser Tyr Gly Pro 
20 

Tie Val Leu Val Gly 
35 

Ser He Leu Gly Arg 
50 

Tie Thr Lys Lys Cys 

65 

Leu Val Val Val Asp 
85 

Ala Glu Thr Ser Lys 
100 

Gly Pro His Ala Leu 
115 

Glu Glu His Lys Ala 
130 

Aia Arg Ser Phe Met 

145 

Asp Thr Asn Leu His 
165 

Asp Leu Met Asp He 
180 



Gly Ser Met Ser 
Gly Arg 
Lys Thr 



Lys Val 
55 

Glu Lys 

70 

Thr Pro 

Glu He 

Leu Leu 

Thr Glu 
135 
He Leu 
150 

Asp Tyr 
Phe Gly 



Gin Glu 

25 

Gly Ala 

40 
Phe His 



Arg Ser 

Gly He 

He Arg 
105 
Val Val 
120 

Lys He 
He Phe 

Leu Arg 

Asp Arg 
185 



Phe Asn Pro 
10 

Pro Arg Asn 

Gly Lys Ser 

Ser Gly Thr 
60 

Ser Ser Trp 
75 

Phe Asp Thr 
90 

Cys He Leu 

Pro Leu Gly 

Leu Lys Met 
140 

Thr Arg Lys 
155 

Glu Ala Pro 
170 

Tyr Cys Ala 



Ser Thr 

Ser Gin 
30 

Ala Thr 
45 

Ala Ala 

Lys Glu 

Glu Val 

Leu Thr 
110 
Arg Tyr 
125 

Phe Gly 

Asp Asp 

Glu Asp 

Leu Asn 
190 



Pro 
15 
Leu 

Gly 

Lys 

Thr 

Pro 
95 
Ser 

Thr 

Glu 

Leu 

He 
175 
Asn 



Gly 

Arg 

Asn 

Ser 

Glu 
80 
Asn 

Pro 

Glu 

Arg 

Gly 
160 
Gin 

Lys 
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Ala Thr Gly Ala Glu Gin Glu Ala Gin Arg Ala Gin Leu Leu Gly Leu 

195 200 205 

lie Gin Arg Val Val Arg Glu Asn Lys Glu Gly Cys Tyr Thr Asn Arg 

210 215 220 

Met Tyr Gin Arg Ala Glu Glu Glu lie Gin Lys Gin Thr Gin Ala Met 
225 230 235 240 

Gin Glu Leu His Arg Val Glu Leu Glu Arg Glu Lys Ala Arg lie Arg 

245 250 255 

Glu Glu Tyr Glu Glu Lys He Arg Lys Leu Glu Asp Lys Val Glu Gin 

260 265 270 

Glu Lys Arg Lys Lys Gin Met Glu Lys Lys Leu Ala Glu Gin Glu Ala 

275 280 285 

His Tyr Ala Val Arg Gin Gin Arg Ala Arg Thr Glu Val Glu Ser Lys 

290 295 300 

Asp Gly He Leu Glu Leu He Met Thr Ala Leu Gin He Ala Ser Phe 
305 310 315 320 

He Leu Leu Arg Leu Phe Ala Glu Asp 
325 



<210> 6 
<211> 987 
<2 12> DNA 

< 2 13 > Homo sapiens 



<400> 6 

atggcagccc aatacggcag tatgagcttc aaccccagca caccaggggc cagttatggg 60 

cctggaaggc aagagcccag aaattcccaa ttgagaattg tgttagtggg taaaaccgga 120 

gcaggaaaaa gtgcaacagg aaacagcatc cttggccgga aagtgt tt ca ttctggcact 18 0 

gcagcaaaat ccattaccaa gaagtgtgag aaacgcagca gctcatggaa ggaaacagaa 24 0 

cttgtcgtag ttgacacacc aggcattttc gacacagagg tgcccaatgc tgaaacgtcc 300 

aaggagatta ttcgctgcat tcttctgacc tccccagggc ctcatgctct gcttctggtg 360 

gt tccactgg gccgt tacac tgaggaagag cacaaagcca cagagaagat cctgaaaatg 420 

tttggagaga gggctagaag tttcatgatt ctcatattca cccggaaaga tgacttaggt 480 

gacaccaatt tgcatgacta cttaagggaa gctccagaag acattcaaga cttgatggac 540 

attttcggtg accgctactg tgcgttaaac aacaaggcaa caggcgctga gcaggaggcc 600 

cagagggcac agttgctggg cctgatccag cgcgtggtga gggagaacaa ggaaggctgc 660 

tacactaata ggatgtacca aagggcggag gaggagatcc agaagcaaac acaagcaatg 720 

caagaactcc acagagtgga gctggagaga gagaaagcgc ggataagaga ggagtatgaa 780 

gagaaaatca gaaagctgga agataaagtg gagcaggaaa agagaaagaa gcaaatggag 84 0 

aagaaactag cagaacagga ggctcactat gctgtaaggc agcaaagggc aagaacggaa 900 

gtggagagta aggatgggat actcgaatta atcatgacag cgttacagat tgcttccttt 960 

attttgttac gtctgttcgc ggaagat 987 



<210> 7 

<211> 4225 

<212> DNA 

<2 13> Homo sapiens 



<220> 
<221> CDS 

<222> (65) ... (3931) 



<400> 7 

gtcgacccac gcgtccgccc ggatgcccgg gccccgaggg gctgctggcg gcctggcccc 60 
tgag atg cgc ggg gcg ggg gcg gcg ggg ctg ctg gcg ctg ctg ctg ctg 109 
Met Arg Gly Ala Gly Ala Ala Gly Leu Leu Ala Leu Leu Leu Leu 
15 10 15 
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ctg ctg ctg ctg ctg ctg ggc ctg ggc ggc agg gtc gag ggg ggg ccg 157 
Leu Leu Leu Leu Leu Leu Gly Leu Gly Gly Arg Val Glu Gly Gly Pro 
20 25 30 

gcc ggc gag egg ggc gca ggc ggg ggc ggg gcg ctg gcc cgc gag cgc 205 
Ala Gly Glu Arg Gly Ala Gly Gly Gly Gly Ala Leu Ala Arg Glu Arg 
35 40 45 

ttc aag gtg gtc ttt gcg ccg gtg ate tge aag egg aec tgt etc aag 253 
Phe Lys Val Val Phe Ala Pro Val lie Cys Lys Arg Thr Cys Leu Lys 
50 55 60 

gge cag tgt egg gac agt tgt cag eag ggc tec aac atg acg etc ate 301 
Gly Gin Cys Arg Asp Ser Cys Gin Gin Gly Ser Asn Met Thr Leu lie 
65 70 75 

99^ 9^9 ciac ggc cae age aea gae aeg etc acg ggc tec ggc ttc cge 349 
Gly Glu Asn Gly His Ser Thr Asp Thr Leu Thr Gly Ser Gly Phe Arg 
80 85 90 95 

gtg gtg gtg tgc cct etc ccc tge atg aat ggc ggc eag tgc tec teg 397 
Val Val Val Cys Pro Leu Pro Cys Met Asn Gly Gly Gin Cys Ser Ser 
100 105 110 

cga aac eag tgc ctg tgt cce ecg gac ttc act ggg ege ttc tgc cag 445 
Arg Asn Gin Cys Leu Cys Pro Pro Asp Phe Thr Gly Arg Phe Cys Gin 

115 120 125 

gtg ccc gca gga gga gcc ggt ggg ggt ace gge gge tea ggc ccc ggc 493 
Val Pro Ala Gly Gly Ala Gly Gly Gly Thr Gly Gly Ser Gly Pro Gly 
130 135 140 

ctg age agg aea ggg gcc ctg tec aca ggg gcg ctg ecg cce ctg get 541 
Leu Ser Arg Thr Gly Ala Leu Ser Thr Gly Ala Leu Pro Pro Leu Ala 
145 150 155 

ccg gag ggc gac tct gtg gcc age aag cae gcc ate tae gee gte eag 589 
Pro Glu Gly Asp Ser Val Ala Ser Lys His Ala He Tyr Ala Val Gin 
160 165 170 175 

gtg ate get gae eet cct ggg ccc ggg gag ggg cct cct gcc cag cae 637 
Val He Ala Asp Pro Pro Gly Pro Gly Glu Gly Pro Pro Ala Gin His 
180 185 190 

gca gcc ttc ctg gtg ccc eta ggc ccg gga cag ate tea gca gaa gtg 685 
Ala Ala Phe Leu Val Pro Leu Gly Pro Gly Gin He Ser Ala Glu Val 
195 200 205 

cag gee ccg ccc ccc gtg gtg aat gtg ege gtc cat cae ccg ccc gag 733 
Gin Ala Pro Pro Pro Val Val Asn Val Arg Val His His Pro Pro Glu 
210 215 220 

gcc tea gtc eag gtg cac cgc att gag age teg aac gee gag age gca 781 
Ala Ser Val Gin Val His Arg He Glu Ser Ser Asn Ala Glu Ser Ala 
225 230 235 
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gcc ccc tec cag cac ctg ctg ccg cac ccc aag ccc teg cac cct egg 829 
Ala Fro Ser Gin His Leu Leu Pro His Pro Lys Pro Ser His Pro Arg 
240 245 250 255 

ccg ccc acc cag aag ccc ctg ggc cgc tgc ttt cag gac act ctg ccc S^7 
Pro Pro Thr Gin Lys Pro Leu Gly Arg Cys Phe Gin Asp Thr Leu Pro 
260 265 270 

aag cag ccg tgt ggc age aac eec etc ccc ggc etc ace aag cag gaa 925 
Lys Gin Pro Cys Gly Ser Asn Pro Leu Pro Gly Leu Thr Lys Gin Glu 
275 280 285 

gac tgc tge ggt age ate ggc act gee tgg ggc cag age aag tge cac 973 
Asp Cys Cys Gly Ser lie Gly Thr Ala Trp Gly Gin Ser Lys Cys His 
290 295 300 

aag tgt ccc cag ctg cag tac aca gga gtg eag aag cca ggg ect gta 1021 
Lys Cys Pro Gin Leu Gin Tyr Thr Gly Val Gin Lys Pro Gly Pro Val 
305 310 315 

cgt ggg gaa gtg ggc get gac tgt ccc cag ggc tac aag agg ctt aac 1069 
Arg Gly Glu Val Gly Ala Asp Cys Pro Gin Gly Tyr Lys Arg Leu Asn 
320 325 330 335 

age acc cac tgc eag gac ate aac gag tgc gca atg ccg ggc gtg tgt 1117 
Ser Thr His Cys Gin Asp lie Asn Glu Cys Ala Met Pro Gly Val Cys 
340 345 350 

cgc cat ggt gac tgc etc aac aac ect ggc tec tat ege tgt gtc tge 1165 
Arg His Gly Asp Cys Leu Asn Asn Pro Gly Ser Tyr Arg Cys Val Cys 
355 360 365 

cea cct ggc cat agt tta ggc ccc tec cgt aca cag tgc att gca gac 1213 
Pro Pro Gly His Ser Leu Gly Pro Ser Arg Thr Gin Cys lie Ala Asp 
370 375 380 

aaa ccg gag gag aag age ctg tgt ttc cgc ctg gtg age cct gag cac 1261 
Lys Pro Glu Glu Lys Ser Leu Cys Phe Arg Leu Val Ser Pro Glu His 
385 390 395 

cag tgc cag cac eca ctg ace acc cgc ctg acc cgc cag etc tgc tgc 1309 
Gin Cys Gin His Pro Leu Thr Thr Arg Leu Thr Arg Gin Leu Cys Cys 
400 405 410 415 

tgc agt gtc ggc aag gcc tgg ggc gcg egg tgt eag cgc tgc cca aca 1357 
Cys Ser Val Gly Lys Ala Trp Gly Ala Arg Cys Gin Arg Cys Pro Thr 
420 425 430 

gat ggc acc get gcg ttc aag gag ate tgc cca get ggg aag gga tac 1405 
Asp Gly Thr Ala Ala Phe Lys Glu lie Cys Pro Ala Gly Lys Gly Tyr 
435 440 445 

eae att etc acc tec cac cag acg etc aee att eag ggc gag agt gac 1453 
His He Leu Thr Ser His Gin Thr Leu Thr He Gin Gly Glu Ser Asp 
450 455 460 
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ttt tec ctt ttc ctg cac cct gac ggg cca ccc aag ccc cag cag ctt 1501 
Phe Ser Leu Phe Leu His Pro Asp Gly Pro Pro Lys Pro Gin Gin Leu 
465 470 475 

ccg gag age cct age eag get cca cca cct gag gac aca gag gaa gag 1549 
Pro Giu Ser Pro Ser Gin Ala Pro Pro Pro Glu Asp Thr Glu Glu Glu 
480 485 490 495 

sga ggg gtg acc acg gac tea ccg gtg agt gag gag agg tea gtg cag 1597 
Arg Gly Val Thr Thr Asp Ser Pro Val Ser Glu Glu Arg Ser Val Gin 
500 505 510 

cag age cac cca act gee acc acg act cct gee egg ccc tac ccc gag 1645 
Gin Ser His Pro Thr Ala Thr Thr Thr Pro Ala Arg Pro Tyr Pro Glu 
515 520 525 

ctg ate tee cgt ccc teg ccc ccg ace atg cgc tgg ttc ctg ccg gac 1693 
Leu lie Ser Arg Pro Ser Pro Pro Thr Met Arg Trp Phe Leu Pro Asp 
530 535 540 

ttg cct cct tec cgc age gee gta gag ate get ccc act eag gtc aca 1741 
Leu Pro Pro Ser Arg Ser Ala Val Glu lie Ala Pro Thr Gin Val Thr 
545 550 555 

gag act gat gag tge cga ctg aac cag aac ate tgt ggc cac gga gag 1789 
Glu Thr Asp Glu Cys Arg Leu Asn Gin Asn lie Cys Gly His Gly Glu 
560 565 570 575 

tge gtg ccg ggc ccc cct gac tac tec tge cac tgc aac ccc gge tac 1837 
Cys Val Pro Gly Pro Pro Asp Tyr Ser Cys His Cys Asn Pro Gly Tyr 
580 585 590 

egg tea cat ccc cag cac cgc tac tgc gtg gat gtg aac gag tgc gag 1885 
Arg Ser His Pro Gin His Arg Tyr Cys Val Asp Val Asn Glu Cys Glu 
595 600 605 

gca gag ccc tgt ggc ccg ggg agg ggc ate tge atg aac acc gge ggc 1933 
Ala Glu Pro Cys Gly Pro Gly Arg Gly He Cys Met Asn Thr Gly Gly 
610 615 620 

tec tae aat tgc cac tgc aac cgc ggc tae cgc ctg cac gtg ggc gee 1981 
Ser Tyr Asn Cys His Cys Asn Arg Gly Tyr Arg Leu His Val Gly Ala 
625 630 635 

999 ggg cgc teg tgc gtg gae ctg aac gaa tge gee aag ccc eac ctg 2029 
Gly Gly Arg Ser Cys Val Asp Leu Asn Glu Cys Ala Lys Pro His Leu 
640 645 650 655 

tgc ggc gae ggc ggc ttc tgc ate aac ttt ccc ggt cac tac aag tgc 2077 
Cys Gly Asp Gly Gly Phe Cys He Asn Phe Pro Gly His Tyr Lys Cys 
660 665 670 

aac tgc tac ccc ggc tac egg etc aaa gee tee egg cct cct gtg tgc 2125 
Asn Cys Tyr Pro Gly Tyr Arg Leu Lys Ala Ser Arg Pro Pro Val Cys 
675 680 685 
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gaa gac ate gac gag tgc egg gac cca age tct tgc ccg gat ggc aaa 2173 
Glu Asp lie Asp Glu Cys Arg Asp Pro Ser Ser Cys Pro Asp Gly Lys 
590 695 700 

tgc gag aac aag ccc ggg age ttc aag tge ate gee tgt eag act ggc 2221 
Cys Glu Asn Lys Pro Gly Ser Phe Lys Cys lie Ala Cys Gin Pro Gly 
705 710 715 

tac cgc age cag ggg ggc ggg gee tgt egc gac gtg aac gag tgc gee 2269 
Tyr Arg Ser Gin Gly Gly Gly Ala Cys Arg Asp Val Asn Glu Cys Ala 
720 725 730 735 

gag ggc age ccc tgc teg cet ggc tgg tge gag aac etc ccg ggc tec 2317 
Glu Gly Ser Pro Cys Ser Pro Gly Trp Cys Glu Asn Leu Pro Gly Ser 
740 745 750 

ttc cgc tgc ace tgt gcc cag ggc tac gcg ccc geg ccc gac ggc cgc 2365 
Phe Arg Cys Thr Cys Ala Gin Gly Tyr Ala Pro Ala Pro Asp Gly Arg 
755 760 765 

agt tgc ttg gat gtg gac gag tgt gag get ggg gac gtg tgt gac aat 2413 
Ser Cys Leu Asp Val Asp Glu Cys Glu Ala Gly Asp Val Cys Asp Asn 
770 775 780 

ggc ate tgc age aac acg cca gga tct ttc cag tgt cag tgc etc tct 2461 
Gly Tie Cys Ser Asn Thr Pro Gly Ser Phe Gin Cys Gin Cys Leu Ser 
785 790 795 

ggc tac cat ctg tec agg gac egg age cac tgc gag gac att gat gag 2509 
Gly Tyr His Leu Ser Arg Asp Arg Ser His Cys Glu Asp lie Asp Glu 
800 805 810 815 

tgt gac ttc cct gca gcc tgc att ggg ggt gac tgc ate aat ace aat 2557 
Cys Asp Phe Pro Ala Ala Cys lie Gly Gly Asp Cys lie Asn Thr Asn 
820 825 830 

ggc tec tac aga tgt ctt tgc ccc cag ggg eat egg ctg gtg ggt ggc 2605 
Gly Ser Tyr Arg Cys Leu Cys Pro Gin Gly His Arg Leu Val Gly Gly 
835 840 845 

agg aaa tgc caa gac ata gat gag tgc age cag gac ccg age ctg tgc 2653 
Arg Lys Cys Gin Asp lie Asp Glu Cys Ser Gin Asp Pro Ser Leu Cys 
850 855 860 

ctt ccc cat ggg gcc tgc aag aac ctt cag ggc tec tat gtg tgt gtc 2701 
Leu Pro His Gly Ala Cys Lys Asn Leu Gin Gly Ser Tyr Val Cys Val 
865 870 875 

tgc gat gag ggc ttc act ccc acc cag gac cag cac ggt tgt gag gag 2749 
Cys Asp Glu Gly Phe Thr Pro Thr Gin Asp Gin His Gly Cys Glu Glu 
880 885 890 895 

gtg gag eag ccc cac cac aag aag gag tgc tac ctg aac ttc gat gac 2797 
Val Glu Gin Pro His His Lys Lys Glu Cys Tyr Leu Asn Phe Asp Asp 
900 905 910 
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aca gtg ttc tgc gac age gta ttg gcc acc aac gtg acc cag cag gag 
Thr Val Phe Cys Asp Ser Val Leu Ala Thr Asn Val Thr Gin Gin Glu 
915 920 925 



^845 



tgc tgc tgc tct ctg ggg gcc ggc tgg ggc gac cac tgc gaa ate tac 
Cys Cys Cys Ser Leu Gly Ala Gly Trp Gly Asp His Cys Glu lie Tyr 
930 935 940 



5893 



GCC tgc cca gtc tac age tea gcc gag ttc cac age etc tgc cca gac 
Pro Cys Pro Val Tyr Ser Ser Ala Glu Phe His Ser Leu Cys Pro Asp 
945 950 955 



:^941 



gga aag ggc tac acc cag gac aac aac ate gtc aac tac ggc ate cca 
Gly Lys Gly Tyr Thr Gin Asp Asn Asn lie Val Asn Tyr Gly He Pro 

960 965 970 975 



2989 



gcc cac cgt gac ate gac gag tgc atg ttg ttc ggg teg gag att tgc 
Ala His Arg Asp He Asp Glu Cys Met Leu Phe Gly Ser Glu He Cys 
980 985 990 



3037 



aag gag ggc aag tgc gtg aac acg cag cct ggc tac gag tgc tac tgc 
Lys Glu Gly Lys Cys Val Asn Thr Gin Pro Gly Tyr Glu Cys Tyr Cys 
995 1000 1005 



3085 



aag cag ggc ttc tac tac gac ggg aac ctg ctg gaa tgc gtg gac gtg 
Lys Gin Gly Phe Tyr Tyr Asp Gly Asn Leu Leu Glu Cys Val Asp Val 
1010 1015 1020 



3133 



gac gag tgc ctg gac gag tec aac tgc egg aac gga gtg tgt gag aac 
Asp Glu Cys Leu Asp Glu Ser Asn Cys Arg Asn Gly Val Cys Glu Asn 
1025 1030 1035 



3181 



acg cgc ggc ggc tac cgc tgt gcc tgc acg cec cct gee gag tac agt 
Thr Arg Gly Gly Tyr Arg Cys Ala Cys Thr Pro Pro Ala Glu Tyr Ser 
1040 1045 1050 1055 



322 9 



ccc gcg cag cgc cag tgc ctg age ccg gaa gag atg gac gtg gac gag 
Pro Ala Gin Arg Gin Cys Leu Ser Pro Glu Glu Met Asp Val Asp Glu 
1060 1065 1070 



32V7 



tgc cag gac ccg gca gcc tgc cgc cct ggc cgc tgc gtc aac ctg ccg 
Cys Gin Asp Pro Ala Ala Cys Arg Pro Gly Arg Cys Val Asn Leu Pro 
1075 1080 1085 



3325 



ggc tec tac cgc tgc gag tgt cgc ccg ccc tgg gtg ccc ggg ccc tec 
Gly Ser Tyr Arg Cys Glu Cys Arg Pro Pro Trp Val Pro Gly Pro Ser 
1090 1095 1100 



3373 



ggc cgc gat tgc cag etc ccc gag age ccg gcc gag cgt gcc ccg gag 
Gly Arg Asp Cys Gin Leu Pro Glu Ser Pro Ala Glu Arg Ala Pro Glu 
1105 1110 1115 



3421 



egg cgc gac gtg tgc tgg age cag cgc gga gag gac ggc atg tgc get 
Arg Arg Asp Val Cys Trp Ser Gin Arg Gly Glu Asp Gly Met Cys Ala 
1120 1125 1130 1135 



3469 
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ggc ccc ctg gcc ggg cct gcc etc acc ttc gac gac tgc tgc tgc cgc 3517 

Gly Pro Leu Ala Gly Pro Ala Leu Thr Phe Asp Asp Cys Cys Cys Arg 
1140 1145 1150 

99c cgc ggc tgg ggc gcc caa tgc cga ccg tgc ccg ccg cgc ggc 3565 

Gin Gly Arg Gly Trp Gly Ala Gin Cys Arg Pro Cys Pro Pro Arg Gly 

1155 1160 1165 



gcg ggg tec cat tgc ccg aca teg cag age gag age aat tec ttc tgg 
Ala Gly Ser His Cys Pro Thr Ser Gin Ser Glu Ser Asn Ser Phe Trp 
1170 1175 1180 



3613 



gac dCd age ccc etg ctg ttg ggg aag ccc cca aga gat gag gac agt 
Asp Thr Ser Pro Leu Leu Leu Gly Lys Pro Pro Arg Asp Glu Asp Ser 
1185 1190 1195 



3661 



tea gag gag gat tea gac gag tgt cgc tgc gtg agt ggc cgc tgc gtg 
Ser Glu Glu Asp Ser Asp Glu Cys Arg Cys Val Ser Gly Arg Cys Val 
1200 1205 1210 1215 



3709 



ccg egg ccg ggc ggc gcc gcg tgc gag tgt ccc ggc ggc ttc cag etc 
Pro Arg Pro Gly Gly Ala Ala Cys Glu Cys Pro Gly Gly Phe Gin Leu 
1220 1225 1230 



3757 



gac gee tec cgc gcc cgc tgc gtg gat ate gac gag tgc cga gag ctg 
Asp Ala Ser Arg Ala Arg Cys Val Asp lie Asp Glu Cys Arg Glu Leu 
1235 1240 1245 



3805 



aac cag cgc ggg ctg ctg tgc aag age gag cgc tgc gtg aac acc age 
Asn Gin Arg Gly Leu Leu Cys Lys Ser Glu Arg Cys Val Asn Thr Ser 
1250 1255 1260 



3853 



ggc tec ttc cgc tgc gt c tgc aaa gcc ggc ttc gcg cgc age cgc ccg 
Gly Ser Phe Arg Cys Val Cys Lys Ala Gly Phe Ala Arg Ser Arg Pro 
1265 1270 1275 



3901 



cac ggg gcc tgc gtt ccc cag cgc egc cgc tgacgeegcc gacgccgccc 
His Gly Ala Cys Val Pro Gin Arg Arg Arg 
1280 1285 



3951 



tcggcceaga ccteggtgat cactgaggga tttccgcgag cteggeetca cttctgcccc 4011 

gacttgtgge teggacccag ggacctteag ggcccgcaga eeetcecggc gccttgagac 4071 

ccgaggcgee ectaceggcc cccctccccg gttagcgggc ggttgtaagg tctccggcgg 4131 

gcgctgcctg ccttcetccc agagggtgtt tcctagaaac tgataaatca gatcgtgcct 4191 

ctttaaaaaa aaaaaaaaaa aaaagggcgg ccgc 42 2 5 

<210> 8 
<211> 1289 
<212> PRT 

<2 13 > Homo sapiens 



<400> 8 

Met Arg Gly Ala Gly Ala Ala Gly Leu Leu Ala Leu Leu Leu Leu Leu 

15 10 15 

Leu Leu Leu Leu Leu Gly Leu Gly Gly Arg Val Glu Gly Gly Pro Ala 

20 25 30 

Gly Glu Arg Gly Ala Gly Gly Gly Gly Ala Leu Ala Arg Glu Arg Phe 
35 40 45 
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Lys Val Val Phe Ala Pro Val lie Cys Lys Arg Thr Cys Lea Lys Gly 

50 55 60 

Gin Cys Arg Asp Ser Cys Gin Gin Gly Ser Asn Met Thr Leu lie Gly 
65 70 75 80 

Glu Asn Gly His Ser Thr Asp Thr Leu Thr Gly Ser Gly Phe Arg Val 

85 90 95 

Val Val Cys Pro Leu Pro Cys Met Asn Gly Gly Gin Cys Ser Ser Arg 

100 105 110 

Asn Gin Cys Leu Cys Pro Pro Asp Phe Thr Gly Arg Phe Cys Gin Val 

115 120 125 

Pro Ala Gly Gly Ala Gly Gly Gly Thr Gly Gly Ser Gly Pro Gly Leu 

130 135 140 

Ser A.rg Thr Gly Ala Leu Ser Thr Gly Ala Leu Pro Pro Leu Ala Pro 
145 150 155 160 

Glu Gly Asp Ser Val Ala Ser Lys His Ala lie Tyr Ala Val Gin Val 

165 170 175 

lie Ala Asp Pro Pro Gly Pro Gly Glu Gly Pro Pro Ala Gin His Ala 

180 185 190 

Ala Phe Leu Val Pro Leu Gly Pro Gly Gin lie Ser Ala Glu Val Gin 

195 200 205 

Ala Pro Pro Pro Val Val Asn Val Arg Val His His Pro Pro Glu Ala 

210 215 220 

Ser Val Gin Val His Arg lie Glu Ser Ser Asn Ala Glu Ser Ala Ala 
225 230 235 240 

Pro Ser Gin His Leu Leu Pro His Pro Lys Pro Ser His Pro Arg Pro 

245 250 255 

Pro Thr Gin Lys Pro Leu Gly Arg Cys Phe Gin Asp Thr Leu Pro Lys 

260 265 270 

Gin Pro Cys Gly Ser Asn Pro Leu Pro Gly Leu Thr Lys Gin Glu Asp 

275 280 285 

Cys Cys Gly Ser lie Gly Thr Ala Trp Gly Gin Ser Lys Cys His Lys 

290 295 300 

Cys Pro Gin Leu Gin Tyr Thr Gly Val Gin Lys Pro Gly Pro Val Arg 
305 310 315 320 

Gly Glu Val Gly Ala Asp Cys Pro Gin Gly Tyr Lys Arg Leu Asn Ser 

325 330 335 

Thr His Cys Gin Asp He Asn Glu Cys Ala Met Pro Gly Val Cys Arg 

340 345 350 

His Gly Asp Cys Leu Asn Asn Pro Gly Ser Tyr Arg Cys Val Cys Pro 

355 360 365 

Pro Gly His Ser Leu Gly Pro Ser Arg Thr Gin Cys He Ala Asp Lys 

370 375 380 

Pro Glu Glu Lys Ser Leu Cys Phe Arg Leu Val Ser Pro Glu His Gin 
385 390 395 400 

Cys Gin His Pro Leu Thr Thr Arg Leu Thr Arg Gin Leu Cys Cys Cys 

405 410 415 

Ser Val Gly Lys Ala Trp Gly Ala Arg Cys Gin Arg Cys Pro Thr Asp 

420 425 430 

Gly Thr Ala Ala Phe Lys Glu He Cys Pro Ala Gly Lys Gly Tyr His 

435 440 445 

He Leu Thr Ser His Gin Thr Leu Thr He Gin Gly Glu Ser Asp Phe 

450 455 460 

Ser Leu Phe Leu His Pro Asp Gly Pro Pro Lys Pro Gin Gin Leu Pro 
465 470 475 480 

Glu Ser Pro Ser Gin Ala Pro Pro Pro Glu Asp Thr Glu Glu Glu Arg 

485 490 495 

Gly Val Thr Thr Asp Ser Pro Val Ser Glu Glu Arg Ser Val Gin Gin 



500 



505 



510 
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Ser His Pro Thr Ala Thr Thr Thr Pro Ala Arg Pro Tyr Pro Glu Leu 

515 520 525 

lie Ser Arg Pro Ser Pro Pro Thr Met Arg Trp Phe Leu Pro Asp Leu 

530 535 540 

Pro Pro Ser Arg Ser Ala Val Glu He Ala Pro Thr Gin Val Thr Glu 
545 550 555 560 

Thr Asp Glu Cys Arg Leu Asn Gin Asn He Cys Gly His Gly Glu Cys 

565 570 575 

Val Pro Gly Pro Pro Asp Tyr Ser Cys His Cys Asn Pro Gly Tyr Arg 

580 585 590 

Ser His Pro Gin His Arg Tyr Cys Val Asp Val Asn Glu Cys Glu Ala 

595 600 605 

Glu Pro Cys Gly Pro Gly Arg Gly He Cys Met Asn Thr Gly Gly Ser 

610 615 620 

Tyr Asn Cys His Cys Asn Arg Gly Tyr Arg Leu His Val Gly Ala Gly 
625 630 635 640 

Gly Arg Ser Cys Val Asp Leu Asn Glu Cys Ala Lys Pro His Leu Cys 

645 650 655 

Gly Asp Gly Gly Phe Cys He Asn Phe Pro Gly His Tyr Lys Cys Asn 

660 665 670 

Cys Tyr Pro Gly Tyr Arg Leu Lys Ala Ser Arg Pro Pro Val Cys Glu 

675 680 685 

Asp He Asp Glu Cys Arg Asp Pro Ser Ser Cys Pro Asp Gly Lys Cys 

690 695 700 

Glu Asn Lys Pro Gly Ser Phe Lys Cys He Ala Cys Gin Pro Gly Tyr 
705 710 715 720 

Arg Ser Gin Gly Gly Gly Ala Cys Arg Asp Val Asn Glu Cys Ala Glu 

725 730 735 

Gly Ser Pro Cys Ser Pro Gly Trp Cys Glu Asn Leu Pro Gly Ser Phe 

740 745 750 

Arg Cys Thr Cys Ala Gin Gly Tyr Ala Pro Ala Pro Asp Gly Arg Ser 

755 760 765 

Cys Leu Asp Val Asp Glu Cys Glu Ala Gly Asp Val Cys Asp Asn Gly 

770 775 780 

He Cys Ser Asn Thr Pro Gly Ser Phe Gin Cys Gin Cys Leu Ser Gly 
785 790 795 800 

Tyr His Leu Ser Arg Asp Arg Ser His Cys Glu Asp He Asp Glu Cys 

805 810 815 

Asp Phe Pro Ala Ala Cys He Gly Gly Asp Cys He Asn Thr Asn Gly 

820 825 830 

Ser Tyr Arg Cys Leu Cys Pro Gin Gly His Arg Leu Val Gly Gly Arg 

835 840 845 

Lys Cys Gin Asp He Asp Glu Cys Ser Gin Asp Pro Ser Leu Cys Leu 

850 855 860 

Pro His Gly Ala Cys Lys Asn Leu Gin Gly Ser Tyr Val Cys Val Cys 
865 870 875 880 

Asp Glu Gly Phe Thr Pro Thr Gin Asp Gin His Gly Cys Glu Glu Val 

885 890 895 

Glu Gin Pro His His Lys Lys Glu Cys Tyr Leu Asn Phe Asp Asp Thr 

900 905 910 

Val Phe Cys Asp Ser Val Leu Ala Thr Asn Val Thr Gin Gin Glu Cys 

915 920 925 

Cys Cys Ser Leu Gly Ala Gly Trp Gly Asp His Cys Glu He Tyr Pro 

930 935 940 

Cys Pro Val Tyr Ser Ser Ala Glu Phe His Ser Leu Cys Pro Asp Gly 
945 950 955 960 

Lys Gly Tyr Thr Gin Asp Asn Asn He Val Asn Tyr Gly He Pro Ala 



965 



970 



975 
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His Arg Asp lie Asp Glu Cys Met Leu Phe Gly Ser Glu He Cys Lys 

980 985 990 

Glu Gly Lys Cys Val Asn Thr Gin Pro Gly Tyr Glu Cys Tyr Cys Lys 

995 1000 1005 

Gin Gly Phe Tyr Tyr Asp Gly Asn Leu Leu Glu Cys Val Asp Val Asp 

1010 1015 1020 

Glu Cys Leu Asp Glu Ser Asn Cys Arg Asn Gly Val Cys Glu Asn Thr 
1025 1030 1035 1040 

Arg Gly Gly Tyr Arg Cys Ala Cys Thr Pro Pro Ala Glu Tyr Ser Pro 

1045 1050 1055 

Ala Gin Arg Gin Cys Leu Ser Pro Glu Glu Met Asp Val Asp Glu Cys 

1060 1065 1070 

Gin Asp Pro Ala Ala Cys Arg Pro Gly Arg Cys Val Asn Leu Pro Gly 

1075 1080 1085 

Ser Tyr Arg Cys Glu Cys Arg Pro Pro Trp Val Pro Gly Pro Ser Gly 

1090 1095 1100 

Arg Asp Cys Gin Leu Pro Glu Ser Pro Ala Glu Arg Ala Pro Glu Arg 
1105 1110 1115 1120 

Arg Asp Val Cys Trp Ser Gin Arg Gly Glu Asp Gly Met Cys Ala Gly 

1125 1130 1135 

Pro Leu Ala Gly Pro Ala Leu Thr Phe Asp Asp Cys Cys Cys Arg Gin 

1140 1145 1150 

Gly Arg Gly Trp Gly Ala Gin Cys Arg Pro Cys Pro Pro Arg Gly Ala 

1155 1160 1165 

Gly Ser His Cys Pro Thr Ser Gin Ser Glu Ser Asn Ser Phe Trp Asp 

1170 1175 1180 

Thr Ser Pro Leu Leu Leu Gly Lys Pro Pro Arg Asp Glu Asp Ser Ser 
1185 1190 1195 1200 

Glu Glu Asp Ser Asp Glu Cys Arg Cys Val Ser Gly Arg Cys Val Pro 

1205 1210 1215 

Arg Pro Gly Gly Ala Ala Cys Glu Cys Pro Gly Gly Phe Gin Leu Asp 

1220 1225 1230 

Ala Ser Arg Ala Arg Cys Val Asp He Asp Glu Cys Arg Glu Leu Asn 

1235 1240 1245 

Gin Arg Gly Leu Leu Cys Lys Ser Glu Arg Cys Val Asn Thr Ser Gly 

1250 1255 1260 

Ser Phe Arg Cys Val Cys Lys Ala Gly Phe Ala Arg Ser Arg Pro His 
1265 1270 1275 1280 

Gly Ala Cys Val Pro Gin Arg Arg Arg 
1285 

<210> 9 

<211> 3867 

<212> DNA 

<213> Homo sapiens 

<400> 9 

atgcgcgggg cgggggcggc ggggctgctg gcgctgctgc tgctgctgct gctgctgctg 60 

ctgggcctgg gcggcagggt cgaggggggg ccggccggcg agcggggcgc aggcgggggc 120 

ggggcgctgg cccgcgagcg cttcaaggtg gtctttgcgc cggtgatctg caagcggacc 180 

tgtctcaagg gccagtgtcg ggacagttgt cagcagggct ccaacatgac gctcatcgga 240 

gagaacggcc acagcacaga cacgctcacg ggctccggct tccgcgtggt ggtgtgccct 300 

ctcccctgca tgaatggcgg ccagtgctcc tcgcgaaacc agtgcctgtg tcccccggac 360 

ttcactgggc gcttctgcca ggtgcccgca ggaggagccg gtgggggtac cggcggctca 420 

ggccccggcc tgagcaggac aggggccctg tccacagggg cgctgccgcc cctggctccg 4 80 

gagggcgact ctgtggccag caagcacgcc atctacgccg tccaggtgat cgctgaccct 540 

cctgggcccg gggaggggcc tcctgcccag cacgcagcct tcctggtgcc cctaggcccg 600 

ggacagatct cagcagaagt gcaggccccg ccccccgtgg tgaatgtgcg cgtccatcac 660 
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ccgcccgagg cctcagtcca ggtgcaccgc attgagagct cgaacgccga gagcgcagcc 720 

ccctcccagc acctgctgcc gcaccccaag ccctcgcacc ctcggccgcc cacccagaag 780 

cccctgggcc gctgctttca ggacactctg cccaagcagc cgtgtggcag caaccccctc 840 

cccggcctca ccaagcagga agactgctgc ggtagcatcg gcactgcctg gggccagagc 900 

aagtgccaca agtgtcccca gctgcagtac acaggagtgc agaagccagg gcctgtacgt 9*-50 

ggggaagtgg gcgctgactg tccccagggc tacaagaggc ttaacagcac ccactgccag 1020 

gacatcaacg agtgcgcaat gccgggcgtg tgtcgccatg gtgactgcct caacaaccct 1080 

ggctcctatc gctgtgtctg cccacctggc catagtttag gcccctcccg tacacagtgc 1140 

attgcagaca aaccggagga gaagagcctg tgtttccgcc tggtgagccc tgagcaccag 1200 

tgccagcacc cactgaccac ccgcctgacc cgccagctct gctgctgcag tgtcggcaag 1260 

gcctggggcg cgcggtgtca gcgctgccca acagatggca ccgctgcgtt caaggagatc 1320 

tgcccagctg ggaagggata ccacattctc acctcccacc agacgctcac cattcagggc 1380 

gagagtgact tttccctctt cctgcaccct gacgggccac ccaagcccca gcagcttccg 1440 

gagagcccta gccaggctcc accacctgag gacacagagg aagagagagg ggtgaccacg 1500 

gactcaccgg tgagtgagga gaggtcagtg cagcagagcc acccaactgc caccacgact 1560 

cctgcccggc cctaccccga gctgatctcc cgtccctcgc ccccgaccat gcgctggttc 1620 

ctgccggact tgcctccttc ccgcagcgcc gtagagatcg ctcccactca ggtcacagag 1680 

actgatgagt gccgactgaa ccagaacatc tgtggccacg gagagtgcgt gccgggcccc 1740 

cctgactact cctgccactg caaccccggc taccggtcac atccccagca ccgctactgc 1800 

gtggatgtga acgagtgcga ggcagagccc tgtggcccgg ggaggggcat ctgcatgaac 1860 

accggcggct cctacaattg ccactgcaac cgcggctacc gcctgcacgt gggcgccggg 1920 

gggcgctcgt gcgtggacct gaacgaatgc gccaagcccc acctgtgcgg cgacggcggc 1980 

ttctgcatca actttcccgg tcactacaag tgcaactgct accccggcta ccggctcaaa 2040 

gcctcccggc ctcctgtgtg cgaagacatc gacgagtgcc gggacccaag ctcttgcccg 2100 

gatggcaaat gcgagaacaa gcccgggagc ttcaagtgca tcgcctgtca gcctggctac 2160 

cgcagccagg ggggcggggc ctgtcgcgac gtgaacgagt gcgccgaggg cagcccctgc 2220 

tcgcctggct ggtgcgagaa cctcccgggc tccttccgct gcacctgtgc ccagggctac 2280 

gcgcccgcgc ccgacggccg cagttgcttg gatgtggacg agtgtgaggc tggggacgtg 2340 

tgtgacaatg gcatctgcag caacacgcca ggatctttcc agtgtcagtg cctctctggc 2400 

taccatctgt ccagggaccg gagccactgc gaggacattg atgagtgtga cttccctgca 2460 

gcctgcattg ggggtgactg catcaatacc aatggctcct acagatgtct ttgcccccag 2520 

gggcatcggc tggtgggtgg caggaaatgc caagacatag atgagtgcag ccaggacccg 2580 

agcctgtgcc ttccccatgg ggcctgcaag aaccttcagg gctcctatgt gtgtgtctgc 2640 

gatgagggct tcactcccac ccaggaccag cacggttgtg aggaggtgga gcagccccac 2700 

cacaagaagg agtgctacct gaacttcgat gacacagtgt tctgcgacag cgtattggcc 2760 

accaacgtga cccagcagga gtgctgctgc tctctggggg ccggctgggg cgaccactgc 2820 

gaaatctacc cctgcccagt ctacagctca gccgagttcc acagcctctg cccagacgga 2880 

aagggctaca cccaggacaa caacatcgtc aactacggca tcccagccca ccgtgacatc 2940 

gacgagtgca tgttgttcgg gtcggagatt tgcaaggagg gcaagtgcgt gaacacgcag 3000 

cctggctacg agtgctactg caagcagggc ttctactacg acgggaacct gctggaatgc 3060 

gtggacgtgg acgagtgcct ggacgagtcc aactgccgga acggagtgtg tgagaacacg 3120 

cgcggcggct accgctgtgc ctgcacgccc cctgccgagt acagtcccgc gcagcgccag 3180 

tgcctgagcc cggaagagat ggacgtggac gagtgccagg acccggcagc ctgccgccct 3240 

ggccgctgcg tcaacctgcc gggctcctac cgctgcgagt gtcgcccgcc ctgggtgccc 3300 

gggccctccg gccgcgattg ccagctcccc gagagcccgg ccgagcgtgc cccggagcgg 3360 

cgcgacgtgt gctggagcca gcgcggagag gacggcatgt gcgctggccc cctggccggg 3420 

cctgccctca ccttcgacga ctgctgctgc cgccagggcc gcggctgggg cgcccaatgc 3480 

cgaccgtgcc cgccgcgcgg cgcggggtcc cattgcccga catcgcagag cgagagcaat 3540 

tccttctggg acacaagccc cctgctgttg gggaagcccc caagagatga ggacagttca 3600 

gaggaggatt cagacgagtg tcgctgcgtg agtggccgct gcgtgccgcg gccgggcggc 3660 

gccgcgtgcg agtgtcccgg cggcttccag ctcgacgcct cccgcgcccg ctgcgtggat 3720 

atcgacgagt gccgagagct gaaccagcgc gggctgctgt gcaagagcga gcgctgcgtg 3780 

aacaccagcg gctccttccg ctgcgtctgc aaagccggct tcgcgcgcag ccgcccgcac 3840 

ggggcctgcg ttccccagcg ccgccgc 3867 



<210> 10 
<211> 4316 
<212> DNA 
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:213> Mus musculus 

;220> 
:221> CDS 

:222> (157) . . . (3915) 
:400> 10 

cctcctgctg tcccctccct acccttggct tctcgccccg ctctgccccc tgctaccaac 60 
actcgatccc ctgctcgggc tcgacctcca atctccgagg gtcgtgcggc cccggatgcc 120 
cgggccccga gcggtgccca cggcctggcc cctgcg atg cgc cag gcc ggc gga 174 

Met Arg Gin Ala Gly Gly 
1 5 

ttg ggg ctg ctg gca eta etc ctg ctg gcg ctg ctg ggc ccc ggc ggc 222 
Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala Leu Leu Gly Pro Gly Gly 
10 15 20 

cga ggg gtg ggc egg ccg ggc age ggg gca cag gcg ggg gcg ggg cgc 270 
Arg Gly Val Gly Arg Pro Gly Ser Gly Ala Gin Ala Gly Ala Gly Arg 
25 30 35 

tgg gcc caa cgc ttc aag gtg gtc ttt gcg cct gtg ate tge aag egg 318 
Trp Ala Gin Arg Phe Lys Val Val Phe Ala Pro Vai lie Cys Lys Arg 
40 45 50 

acc tgt ctg aag ggc cag tgt egg gae age tgt cag cag ggc tec aac 366 
Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser Cys Gin Gin Gly Ser Asn 
55 60 65 70 

atg acg etc ate gga gag aac ggc cac age acc gac acg etc acc ggt 4 14 

Met Thr Leu lie Gly Glu Asn Gly His Ser Thr Asp Thr Leu Thr Gly 
75 80 85 

tct gcc ttc cgc gtg gtg gtg tge ect eta ccc tgc atg aac ggt ggc 462 
Ser Ala Phe Arg Val Val Val Cys Pro Leu Pro Cys Met Asn Gly Gly 
90 95 100 

cag tgc tct tec cga aac cag tgc ctg tgt ccc ccg gat ttc acg ggg 510 
Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys Pro Pro Asp Phe Thr Gly 
105 110 115 

cgc ttc tgc cag gtg cct get gca gga acc gga get ggc acc ggg agt 558 
Arg Phe Cys Gin Val Pro Ala Ala Gly Thr Gly Ala Gly Thr Gly Ser 
120 125 130 

tea gge ccc ggc tgg ccc gac egg gcc atg tec aca ggc ccg ctg ccg 606 
Ser Gly Pro Gly Trp Pro Asp Arg Ala Met Ser Thr Gly Pro Leu Pro 
135 140 145 150 

ccc ett gcc cca gaa gga gag tct gtg get age aaa cac gcc att tae 654 
Pro Leu Ala Pro Glu Gly Glu Ser Val Ala Ser Lys His Ala lie Tyr 
155 160 165 

gcg gtg cag gtg ate gca gat cct ccc ggg ccg ggg gag ggt cct cct 702 
Ala Val Gin Val lie Ala Asp Pro Pro Gly Pro Gly Glu Gly Pro Pro 
170 175 180 
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gca caa cat gca gcc ttc ttg gtg ccc ctg ggg cca gga caa ate teg 750 
Ala Gin His Ala Ala Phe Leu Val Pro Leu Gly Pro Gly Gin He Ser 
185 190 195 

gca gaa gtg cag get ecg ccc ccc gtg gtg aac gtg cgt gtc cat cac 798 
Ala Glu Val Gin Ala Pro Pro Pro Val Val Asn Val Arg Val His His 
200 205 210 

cct cct gaa get tec gtt cag gtg cac cgc ate gag ggg ccg aac get 846 
Pro Pro Glu Ala Ser Val Gin Val His Arg He Glu Gly Pro Asn Ala 
215 220 225 230 

gaa ggc cca gcc tet tee cag eac ttg ctg ccg cat ccc aag ccc ccg 894 
Glu Gly Pro Ala Ser Ser Gin His Leu Leu Pro His Pro Lys Pro Pro 
235 240 245 

cac ccg agg cca ccc act caa aag cca ctg ggc cgc tgc ttc cag gae 942 
His Pro Arg Pro Pro Thr Gin Lys Pro Leu Gly Arg Cys Phe Gin Asp 
250 255 260 

aca ttg ccc aag cag cct tgt ggc age aac cct ttg cct ggc ctt ace 990 
Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn Pro Leu Pro Gly Leu Thr 
265 270 275 

aag cag gaa gat tgc tgc ggt age ate ggt act gee tgg gga caa age 1038 
Lys Gin Glu Asp Cys Cys Gly Ser He Gly Thr Ala Trp Gly Gin Ser 
280 285 290 

aag tgt cac aag tgc cca cag ctt cag tat aca ggg gtg cag aag cct 1086 
Lys Cys His Lys Cys Pro Gin Leu Gin Tyr Thr Gly Val Gin Lys Pro 
295 300 305 310 

gta cct gta cgt ggg gag gtg ggt get gac tgc ccc cag ggc tac aag 1134 
Val Pro Val Arg Gly Glu Val Gly Ala Asp Cys Pro Gin Gly Tyr Lys 
315 320 325 

agg etc aac age acc cac tgc cag gat ate aac gaa tgt gcg atg ccc 1182 
Arg Leu Asn Ser Thr His Cys Gin Asp He Asn Glu Cys Ala Met Pro 
330 335 340 

ggg aat gtg tgc cat ggt gac tgc etc aac aac cct ggc tet tat cgc 1230 
Gly Asn Val Cys His Gly Asp Cys Leu Asn Asn Pro Gly Ser Tyr Arg 
345 ~ 350 355 

tgt gtc tgc ccg ccc ggt cat age ttg ggt ccc etc gca gca cag tgc 1278 
Cys Val Cys Pro Pro Gly His Ser Leu Gly Pro Leu Ala Ala Gin Cys 
360 365 370 

att gcc gac aaa cca gag gag aag age ctg tgt ttc cgc ctt gtg age 1326 
He Ala Asp Lys Pro Glu Glu Lys Ser Leu Cys Phe Arg Leu Val Ser 
375 380 385 390 

ace gaa cac cag tgc cag cac cct ctg acc aca cgc eta acc cgc cag 1374 
Thr Glu His Gin Cys Gin His Pro Leu Thr Thr Arg Leu Thr Arg Gin 
395 400 405 
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etc tgc tgc tgt agt gtg ggt aaa gcc tgg ggt gcc egg tge eag cgc 1422 
Leu Cys Cys Cys Ser Val Gly Lys Ala Trp Gly Ala Arg Cys Gin Arg 
410 415 420 

tge ccg gca gat ggt aca gca gcc ttc aag gag ate tge ccc gge tgg 1470 
Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys Glu lie Cys Pro Gly Trp 
425 430 435 

gaa agg gta cca tat cct cac etc cca cca gac get eae cat cea ggg 1518 
Glu Arg Val Pro Tyr Pro His Leu Pro Pro Asp Ala His His Pro Gly 
440 445 450 

gga aag cga ett etc cet ett cct gca eec gac ggg cca ccc aaa ccc 1566 
Gly Lys Arg Leu Leu Pro Leu Pro Ala Pro Asp Gly Pro Pro Lys Pro 
455 460 465 470 

eag cag ett cct gaa age ccc age cga gca cca ccc etc gag gac aea 1614 
Gin Gin Leu Pro Glu Ser Pro Ser Arg Ala Pro Pro Leu Glu Asp Thr 
475 480 485 

gag gaa gag aga gga gtg aec atg gat cea cca gtg agt gag gag cga 1662 
Glu Glu Glu Arg Gly Val Thr Met Asp Pro Pro Val Ser Glu Glu Arg 
490 495 500 

teg gtg eag cag age cac ccc act ace aec ace tea ccc eec egg cet 1710 
Ser Val Gin Gin Ser His Pro Thr Thr Thr Thr Ser Pro Pro Arg Pro 
505 510 515 

tae cea gag etc ate tct cgc ccc tec cca cct ace ttc cac egg ttc 1758 
Tyr Pro Glu Leu lie Ser Arg Pro Ser Pro Pro Thr Phe His Arg Phe 
520 525 530 

ctg cca gae ttg eee cca tec cga agt gea gtg gag ate gcc eee act 1806 
Leu Pro Asp Leu Pro Pro Ser Arg Ser Ala Val Glu lie Ala Pro Thr 
535 540 545 550 

cag gtc aca gag ace gat gag tgc cga ttg aac cag aat ate tgt ggc 1854 
Gin Val Thr Glu Thr Asp Glu Cys Arg Leu Asn Gin Asn lie Cys Gly 
555 560 565 

cat gga cag tgt gtg cct ggc eee teg gat tac tee tge eae tge aac 1902 
His Gly Gin Cys Val Pro Gly Pro Ser Asp Tyr Ser Cys His Cys Asn 
570 575 580 

get ggc tac egg tea cac ccg eag eae egc tae tgt gtt gat gtg aac 1950 
Ala Gly Tyr Arg Ser His Pro Gin His Arg Tyr Cys Val Asp Val Asn 
585 590 595 

gag tgc gag gca gag cce tgc ggc ccc ggg aaa ggc ate tgt atg aac 1998 
Glu Cys Glu Ala Glu Pro Cys Gly Pro Gly Lys Gly lie Cys Met Asn 
600 605 610 

act ggt ggc tec tac aat tgt cac tgc aac cga ggc tac cgc etc cac 2046 
Thr Gly Gly Ser Tyr Asn Cys His Cys Asn Arg Gly Tyr Arg Leu His 
615 620 625 630 
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9*^9 99t gca ggg ggc cgc teg tgc gtg gac ctg aac gag tgc gcc aag 2094 
Val Gly Ala Gly Gly Arg Ser Cys Val Asp Leu Asn Glu Cys Ala Lys 
635 640 645 

cct cac ctg tgt ggg gac ggt ggc ttc tgc ate aac ttc cct ggt cac 2142 
Pro His Leu Cys Gly Asp Gly Gly Phe Cys lie Asn Phe Pro Gly His 
650 655 660 

tac aaa tgc aac tgc tat cct ggc tac egg etc aag gcc tee cga ccg 2190 
Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg Leu Lys Ala Ser Arg Pro 

665 670 675 

cec att tge gaa gac ate gac gag tgt cgc gac cct age acc tgc cct 2238 
Pro lie Cys Glu Asp He Asp Glu Cys Arg Asp Pro Ser Thr Cys Pro 
680 685 690 

gat ggc aaa tgt gaa aac aaa cct ggc age tte aag tge ate gee tgc 2286 
Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser Phe Lys Cys lie Ala Cys 

695 700 705 710 

cag eet ggc tac egt age eag ggg ggc ggg gee tgt cgt gat gtc aac 2334 
Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly Ala Cys Arg Asp Val Asn 
715 720 725 

gaa tgc tee gag ggt ace cec tgc tct cct gga tgg tgt gag aac ctt 2382 
Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro Gly Trp Cys Glu Asn Leu 
730 735 740 

ccg ggt tct tac cgt tgc aeg tgt gcc cag ggg ata cga acc ege aca 2430 
Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin Gly He Arg Thr Arg Thr 
745 750 755 

gga cgc etc agt tgc ata gac gtg gat gag tgt gag get ggg aaa gtg 2478 
Gly Arg Leu Ser Cys He Asp Val Asp Glu Cys Glu Ala Gly Lys Val 
760 765 770 

tge caa gat ggc ate tge aeg aac aca cca ggc tct tte cag tgt eag 2526 
Cys Gin Asp Gly He Cys Thr Asn Thr Pro Gly Ser Phe Gin Cys Gin 
775 780 785 790 

tge etc tec gge tat eat ctg tea agg gat egg age cgc tgt gag gae 2574 
Cys Leu Ser Gly Tyr His Leu Ser Arg Asp Arg Ser Arg Cys Glu Asp 
795 800 805 

att gat gaa tgt gac ttc cct gcg gcc tgc ate ggg ggt gac tgc ate 2622 
He Asp Glu Cys Asp Phe Pro Ala Ala Cys He Gly Gly Asp Cys He 
810 815 820 

aat acc aat ggt tec tac aga tgt etc tgt cec ctg ggt cat egg ttg 2670 
Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys Pro Leu Gly His Arg Leu 
825 830 835 

gtg ggc ggc agg aag tgc aag aaa gat ata gat gag tgc age eag gac 2718 
Val Gly Gly Arg Lys Cys Lys Lys Asp He Asp Glu Cys Ser Gin Asp 
840 845 850 
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cca ggc ctg tgc ctg ccc cat gcc tgc gag aac etc cag ggc tec tat 2766 
Pro Gly Leu Cys Leu Pro His Ala Cys Glu Asn Leu Gin Gly Ser Tyr 
855 860 865 870 



gtc tgt gtc tgt gat gag ggt ttc aca etc acc cag gac cag cat ggg 
Val Cys Val Cys Asp Glu Gly Phe Thr Leu Thr Gin Asp Gin His Gly 
875 880 885 



2814 



tgt gag gag gtg gag cag ccc cac cac aag aag gag tgc tac ctt aac 
Cys Glu Glu Val Glu Gin Pro His His Lys Lys Glu Cys Tyr Leu Asn 
890 895 900 



2862 



ttc gat gac aca gtg ttc tgt gac age gta ttg get ace aat gtc act 
Phe Asp Asp Thr Val Phe Cys Asp Ser Val Leu Ala Thr Asn Val Thr 
905 910 915 



>910 



cag cag gaa tgc tgt tgc tct ctg gga get ggc tgg gga gac cac tgc 
Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala Gly Trp Gly Asp His Cys 
920 925 930 



1958 



gaa ate tat ece tgt cca gtc tac age tea gee gaa ttt cae age ctg 
Glu He Tyr Pro Cys Pro Val Tyr Ser Ser Ala Glu Phe His Ser Leu 
935 940 945 950 



3006 



gtg cct gat ggg aaa agg eta cac tea gga caa caa eat tgt gaa eta 
Val Pro Asp Gly Lys Arg Leu His Ser Gly Gin Gin His Cys Glu Leu 

955 960 965 



3 0 54 



tgc att cct gee cac cgt gac ate gac gaa tge ata ttg ttt ggg gca 
Cys He Pro Ala His Arg Asp He Asp Glu Cys He Leu Phe Gly Ala 
970 975 980 



3io: 



gag ate tgc aag gag ggc aag tgt gtg aac aeg cag ece ggc tac gag 
Glu He Cys Lys Glu Gly Lys Cys Val Asn Thr Gin Pro Gly Tyr Glu 
985 990 995 



3150 



tgc tac tge aag cag ggc ttc tac tac gat ggc aac ctg ctg gag tge 
Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp Gly Asn Leu Leu Glu Cys 
1000 1005 1010 



3198 



gtg gac gtg gat gag tge ttg gat gag tct aac tgc agg aac gga gtg 
Val Asp Val Asp Glu Cys Leu Asp Glu Ser Asn Cys Arg Asn Gly Val 
1015 1020 1025 1030 



3246 



tgt gag aac aca cgt ggc ggc tac cgc tgt gee tgc act ceg ecg gea 
Cys Glu Asn Thr Arg Gly Gly Tyr Arg Cys Ala Cys Thr Pro Pro Ala 
1035 1040 1045 



3294 



gag tac agt ccc gca cag gcc cag tgt ctg ate ccg gag aga tgg age 
Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu He Pro Glu Arg Trp Ser 
1050 1055 1060 



3342 



aeg ece cag aga gac gtg aag tgt get ggg gcc age gag gag agg aeg 
Thr Pro Gin Arg Asp Val Lys Cys Ala Gly Ala Ser Glu Glu Arg Thr 
1065 1070 1075 



3390 
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gca tgt gta tgg ggc ccc tgg gcg gga cct gcc etc act ttt gat gac 
Ala Cys Val Trp Gly Pro Trp Ala Gly Pro Ala Leu Thr Phe Asp Asp 
1080 1085 1090 



3438 



tgc tgc tgc cgc cag ccg egg ctg ggt acc cag tgc aga ccg tgc ccg 
Cys Cys Cys Arg Gin Pro Arg Leu Gly Thr Gin Cys Arg Pro Cys Pro 
1095 1100 1105 1110 



3486 



cca cgt ggc acc ggg tec cag tgc ccg act tea cag agt gag age aat 
Pro Arg Gly Thr Gly Ser Gin Cys Pro Thr Ser Gin Ser Glu Ser Asn 
1115 1120 1125 



3534 



tct ttc tgg gac aca age ccc ctg eta ctg ggg aag tet ccg cga gac 
Ser Phe Trp Asp Thr Ser Pro Leu Leu Leu Gly Lys Ser Pro Arg Asp 
1130 1135 1140 



3582 



gaa gac age tea gag gag gat tea gat gag tgc cgt tgt gtg age gga 
Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu Cys Arg Cys Val Ser Gly 
1145 1150 1155 



3630 



cgc tgt gtg cca egg cca ggc ggg gcg gta tgc gag tgt ect gga ggc 
Arg Cys Val Pro Arg Pro Gly Gly Ala Val Cys Glu Cys Pro Gly Gly 
1160 1165 1170 



3678 



ttt cag ctg gac gee tee cgt gee cgc tgc gtg gac att gat gag tgc 
Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys Val Asp lie Asp Glu Cys 
1175 1180 1185 1190 



3726 



cga gaa ctg aac cag egg gga ctg ctg tgt aag age gag egg tgc gtg 
Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys Lys Ser Glu Arg Cys Val 
1195 1200 1205 



3774 



aac acc agt gga tec ttc cgc tgt gtc tgc aaa get ggc tte aeg cgc 
Asn Thr Ser Gly Ser Phe Arg Cys Val Cys Lys Ala Gly Phe Thr Arg 
1210 1215 1220 



3822 



age cgc cct cac ggg cct gcg tgc etc age gcc gee get gat gat gca 
Ser Arg Pro His Gly Pro Ala Cys Leu Ser Ala Ala Ala Asp Asp Ala 
1225 1230 1235 



3370 



gee ata gee cac ace tea gtg ate gat eat cga ggg tat ttt cac 
Ala He Ala His Thr Ser Val He Asp His Arg Gly Tyr Phe His 
1240 1245 1250 



3915 



tgaaagtgga 
ggatcettca 
cagagtgctg 
cctttaattt 
caactgettg 
etctgattat 
taaataeatc 



gacagacaag 
gggeccacaa 
tctgctttct 
act ct tggct 
gcgggttgga 
gtacactaga 
cgccgggaaa 



tacatccttt 
atctecttcc 
cccaagggtg 
ttcaaggcaa 
ctgagctggg 
tgtgeetgac 
aaaaaaaaaa 



gctcctgacc 
cacaccccaa 
attcctagaa 
attgatattc 
acccaggatg 
ctgctgacea 
aaaaaaaaaa 



aaacgagagc 
acccaaggtg 
aettcgaeat 
acatccaaag 
tgaaatagaa 
ggctcacatg 



atggacccaa 
ctcctgtctg 
cagatetgee 
cgggcagcat 
tttattgtgg 
gtttgtacaa 



3975 
4035 
4095 
4155 
4215 
4275 
4316 



<210> 11 

<211> 1253 

<212> PRT 

<213> Mus musculus 
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<400> 11 



Met 


Arg 


Gin 


Ala 


Gly 


Gly 


Leu 


Gly 


Leu 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 


Ala 


1 








5 










10 










15 




Leu 


Leu 


Gly 


Pro 


Gly 


Gly 


Arg 


Gly 


Val 


Gly 


Arg 


Pro 


Gly 


Ser 


Gly 


Ala 








20 










25 










30 






Gin 


Ala 


Gly 


Ala 


Gly 


Arg 


Trp 


Ala 


Gin 


Arg 


Phe 


Lys 


Val 


Val 


Phe 


Ala 






35 










40 










45 








Pro 


Val 


He 


Cys 


Lys 


Arg 


Thr 


Cys 


Leu 


Lys 


Gly 


Gin 


Cys 


Arg 


Asp 


Ser 




50 










55 










60 










Cys 


Gin 


Gin 


Gly 


Ser 


Asn 


Met 


Thr 


Leu 


He 


Gly 


Glu 


Asn 


Gly 


His 


Ser 


65 










70 










75 










80 


Thr 


Asp 


Thr 


Leu 


Thr 


Gly 


Ser 


Ala 


Phe 


Arg 


Val 


Val 


Val 


Cys 


Pro 


Leu 










85 










90 










95 




Pro 


Cys 


Met 


Asn 


Gly 


Gly 


Gin 


Cys 


Ser 


Ser 


Arg 


Asn 


Gin 


Cys 


Leu 


Cys 








100 










105 










110 






Pro 


Pro 


Asp 


Phe 


Thr 


Gly 


Arg 


Phe 


Cys 


Gin 


Val 


Pro 


Ala 


Ala 


Gly 


Thr 






115 










120 










125 








Gly Ala 


Gly 


Thr 


Gly 


Ser 


Ser 


Gly 


Pro 


Gly 


Trp 


Pro 


Asp 


Arg 


Ala 


Met 




130 










135 










140 










Ser 


Thr 


Gly 


Pro 


Leu 


Pro 


Pro 


Leu 


Ala 


Pro 


Glu 


Gly 


Glu 


Ser 


Val 


Ala 


145 










150 










155 










160 


Ser 


Lys 


His 


Ala 


He 


Tyr 


Ala 


Val 


Gin 


Val 


He 


Ala 


Asp 


Pro 


Pro 


Gly 










165 










170 










175 




Pro 


Gly 


Glu 


Gly 


Pro 


Pro 


Ala 


Gin 


His 


Ala 


Ala 


Phe 


Leu 


Val 


Pro 


Leu 








180 










185 










190 






Gly 


Pro 


Gly 


Gin 


He 


Ser 


Ala 


Glu 


Val 


Gin 


Ala 


Pro 


Pro 


Pro 


Val 


Val 






195 










200 










205 








Asn 


Val 


Arg 


Val 


His 


His 


Pro 


Pro 


Glu 


Ala 


Ser 


Val 


Gin 


Val 


His 


Arg 




210 










215 










220 










He 


Glu 


Gly 


Pro 


Asn 


Ala 


Glu 


Gly 


Pro 


Ala 


Ser 


Ser 


Gin 


His 


Leu 


Leu 


225 










230 










235 










240 


Pro 


His 


Pro 


Lys 


Pro 


Pro 


His 


Pro 


Arg 


Pro 


Pro 


Thr 


Gin 


Lys 


Pro 


Leu 










245 










250 










255 




Gly 


Arg 


Cys 


Phe 


Gin 


Asp 


Thr 


Leu 


Pro 


Lys 


Gin 


Pro 


Cys 


Gly 


Ser 


Asn 








260 










265 










270 






Pro 


Leu 


Pro 


Gly 


Leu 


Thr 


Lys 


Gin 


Glu 


Asp 


Cys 


Cys 


Gly 


Ser 


He 


Gly 






275 










280 










285 








Thr 


Ala 


Trp 


Gly 


Gin 


Ser 


Lys 


Cys 


His 


Lys 


Cys 


Pro 


Gin 


Leu 


Gin 


Tyr 




290 










295 










300 










Thr 


Gly 


Val 


Gin 


Lys 


Pro 


val 


Pro 


Val 


Arg 


Gly 


Glu 


Val 


Gly 


Ala 


Asp 


305 










310 










315 










320 


Cys 


Pro 


Gin 


Gly 


Tyr 


Lys 


Arg 


Leu 


Asn 


Ser 


Thr 


His 


Cys 


Gin 


Asp 


He 










325 










330 










335 




Asn 


Glu 


Cys 


Ala 


Met 


Pro 


Gly 


Asn 


Val 


Cys 


His 


Gly 


Asp 


Cys 


Leu 


Asn 








340 










345 










350 






Asn 


Pro 


Gly 


Ser 


Tyr 


Arg 


Cys 


Val 


Cys 


Pro 


Pro 


Gly 


His 


Ser 


Leu 


Gly 






355 










360 










365 








Pro 


Leu 


Ala 


Ala 


Gin 


Cys 


lie 


Ala 


Asp 


Lys 


Pro 


Glu 


Glu 


Lys 


Ser 


Leu 




370 










375 










380 










Cys 


Phe 


Arg 


Leu 


Val 


Ser 


Thr 


Glu 


His 


Gin 


Cys 


Gin 


His 


Pro 


Leu 


Thr 


385 










390 










395 










400 


Thr 


Arg 


Leu 


Thr 


Arg 


Gin 


Leu 


Cys 


Cys 


Cys 


Ser 


Val 


Gly 


Lys 


Ala 


Trp 










405 










410 










415 




Gly 


Ala 


Arg 


Cys 


Gin 


Arg 


Cys 


Pro 


Ala 


Asp 


Gly 


Thr 


Ala 


Ala 


Phe 


Lys 








420 










425 










430 






Glu 


He 


Cys 


Pro 


Gly 


Trp 


Glu 


Arg 


Val 


Pro 


Tyr 


Pro 


His 


Leu 


Pro 


Pro 






435 










440 










445 
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Asp 


Ala 


His 


His 


Pro 


Gly 


Gly 


Lys 


Arg 


Leu 


Leu 


Pro 


Leu 


Pro 


Ala 


Pro 




4 5 0 










455 










4 6 0 










Asp 


Gly 


Pro 


Pro 


Lys 


Pro 


Gin 


Gin 


Leu 


Pro 


Glu 


Ser 


Pro 


Ser 


Arg 


Ala 


4 65 










470 










4 7 5 










4 8 0 


Pro 


Pro 


Leu 


Glu 


Asp 


Thr 


Glu 


Glu 


Glu 


Arg 


Gly 


Val 


Thr 


Met 


Asp 


Pro 










4 85 










4 90 










4 95 




Pro 


Val 


Ser 


Glu 


Glu 


Arg 


Ser 


Val 


Gin 


Gin 


Ser 


His 


Pro 


Thr 


Thr 


Thr 








50 0 










505 










510 






Thr 


Ser 


Pro 


Pro 


Arg 


Pro 


Tyr 


Pro 


Glu 


Leu 


He 


Ser 


Arg 


Pro 


Ser 


Pro 






515 










5 2 0 










52 5 








Pro 


Thr 


Phe 


His 


Arg 


Phe 


Leu 


Pro 


Asp 


Leu 


Pro 


Pro 


Ser 


Arg 


Ser 


Ala 




53 0 










53 5 










54 0 










Val 


Glu 


He 


Ala 


Pro 


Thr 


Gin 


Val 


Thr 


Glu 


Thr 


Asp 


Glu 


Cys 


Arg 


Leu 


54 5 










55 0 










555 










56 0 


Asn 


Gin 


Asn 


He 


Cys 


Gly 


His 


Gly 


Gin 


Cys 


Val 


Pro 


Gly 


Pro 


Ser 


Asp 










5 65 










570 










5 75 




Tyr 


Ser 


Cys 


His 


Cys 


Asn 


Ala 


Gly 


Tyr 


Arg 


Ser 


His 


Pro 


Gin 


His 


Arg 








5 80 










585 










590 






Tyr 


Cys 


Val 


Asp 


Val 


Asn 


Glu 


Cys 


Glu 


Ala 


Glu 


Pro 


Cys 


Gly 


Pro 


Gly 






5 95 










6 00 










60 5 








Lys 


Gly 


He 


Cys 


Met 


Asn 


Thr 


Gly 


Gly 


Ser 


Tyr 


Asn 


Cys 


His 


Cys 


Asn 




610 










615 










620 










Arg 


Gly 


Tyr 


Arg 


Leu 


His 


Val 


Gly 


Ala 


Gly 


Gly 


Arg 


Ser 


Cys 


Val 


Asp 


6 2 5 










6 3 0 










6 3 5 










64 0 


Leu 


Asn 


Glu 


Cys 


Ala 


Lys 


Pro 


His 


Leu 


Cys 


Gly 


Asp 


Gly 


Gly 


Phe 


Cys 










645 










65 0 










6 55 




lie 


Asn 


Phe 


Pro 


Gly 


His 


Tyr 


Lys 


Cys 


Asn 


Cys 


Tyr 


Pro 


Gly 


Tyr 


Arg 








660 










665 










6 7 0 






Leu 


Lys 


Ala 


Ser 


Arg 


Pro 


Pro 


He 


Cys 


Glu 


Asp 


He 


Asp 


Glu 


Cys 


Arg 






67 5 










6 80 










6 85 








Asp 


Pro 


Ser 


Thr 


Cys 


Pro 


Asp 


Gly 


Lys 


Cys 


Glu 


Asn 


Lys 


Pro 


Gly 


Ser 




690 










6 95 










70 0 










Phe 


Lys 


Cys 


He 


Ala 


Cys 


Gin 


Pro 


Gly 


Tyr 


Arg 


Ser 


Gin 


Gly 


Gly 


Gly 


705 










710 










715 










72 0 


Ala 


Cys 


Arg 


Asp 


Val 


Asn 


Glu 


Cys 


Ser 


Glu 


Gly 


Thr 


Pro 


Cys 


Ser 


Pro 










725 










730 










735 




Gly 


Trp 


Cys 


Glu 


Asn 


Leu 


Pro 


Gly 


Ser 


Tyr 


Arg 


Cys 


Thr 


Cys 


Ala 


Gin 








740 










74 5 










750 






Gly 


He 


Arg 


Thr 


Arg 


Thr 


Gly 


Arg 


Leu 


Ser 


Cys 


He 


Asp 


Val 


Asp 


Glu 






755 










760 










765 








Cys 


Glu 


Ala 


Gly 


Lys 


Val 


Cys 


Gin 


Asp 


Gly 


He 


Cys 


Thr 


Asn 


Thr 


Pro 




770 










775 










780 










Gly 


Ser 


Phe 


Gin 


Cys 


Gin 


Cys 


Leu 


Ser 


Gly 


Tyr 


His 


Leu 


Ser 


Arg 


Asp 


7 85 










790 










795 










80 0 


Arg 


Ser 


Arg 


Cys 


Glu 


Asp 


He 


Asp 


Glu 


Cys 


Asp 


Phe 


Pro 


Ala 


Ala 


Cys 










805 










810 










815 




lie 


Gly 


Gly 


Asp 


Cys 


He 


Asn 


Thr 


Asn 


Gly 


Ser 


Tyr 


Arg 


Cys 


Leu 


Cys 








820 










825 










830 






Pro 


Leu 


Gly 


His 


Arg 


Leu 


Val 


Gly 


Gly 


Arg 


Lys 


Cys 


Lys 


Lys 


Asp 


He 






83 5 










84 0 










845 








Asp 


Glu 


Cys 


Ser 


Gin 


Asp 


Pro 


Gly 


Leu 


Cys 


Leu 


Pro 


His 


Ala 


Cys 


Glu 




850 










855 










860 










Asn 


Leu 


Gin 


Gly 


Ser 


Tyr 


Val 


Cys 


Val 


Cys 


Asp 


Glu 


Gly 


Phe 


Thr 


Leu 


865 










870 










875 










880 


Thr 


Gin 


Asp 


Gin 


His 


Gly 


Cys 


Glu 


Glu 


Val 


Glu 


Gin 


Pro 


His 


His 


Lys 










885 










890 










895 




Lys 


Glu 


Cys 


Tyr 


Leu 


Asn 


Phe 


Asp 


Asp 


Thr 


Val 


Phe 


Cys 


Asp 


Ser 


Val 








900 










905 










910 
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Leu 


Ala 


Thr 

915 


Asn 


Val 


Thr 


Gin 


Gin 
920 


Glu 


Cys 


Cys 


Cys 


Ser 
925 


Leu 


Gly 


Ala 


Gly 


Trp 
930 


Gly 


Asp 


His 


Cys 


Glu 
93 5 


He 


Tyr 


Pro 


Cys 


Pro 
94 0 


Val 


Tyr 


Ser 


Ser 


Ala 


Glu 


Phe 


His 


Ser 


Leu 


Val 


Pro 


Asp 


Gly 


Lys 


Arg 


Leu 


His 


Se r 


Gly 


94 5 










9 5 0 










955 










9 6 0 


Gin 


Gin 


His 


Cys 


Glu 
965 


Leu 


Cys 


He 


Pro 


Ala 
97 0 


His 


Arg 


Asp 


He 


Asp 
97 5 


Glu 


Cys 


He 


Leu 


Phe 
980 


Gly 


Ala 


Glu 


He 


Cys 
985 


Lys 


Glu 


Gly 


Lys 


Cys 
990 


Val 


Asn 


Thr 


Gin 


Pro 


Gly 


Tyr 


Glu 


Cys 


Tyr 


Cys 


Lys 


Gin 


Gly 


Phe 


Tyr 


Tyr 


Asp 






995 










1000 








1005 






Gly 


Asn 


Leu 


Leu 


Glu 


Cys 


Val 


Asp 


Val 


Asp 


Glu 


Cys 


Leu 


Asp 


Glu 


Ser 




1010 








1015 








1020 








Asn 


Cys 


Arg 


Asn 


Gly 


Val 


Cys 


Glu 


Asn 


Thr 


Arg 


Gly Gly 


Tyr 


Arg 


Cys 


1025 








1030 








1035 








1040 


Ala 


Cys 


Thr 


Pro 


Pro 


Ala 


Glu 


Tyr 


Ser 


Pro 


Ala 


Gin 


Ala 


Gin 


Cys 


Leu 










1045 








1050 








1055 


lie 


Pro 


Glu 


Arg 


Trp 


Ser 


Thr 


Pro 


Gin 


Arg 


Asp 


Val 


Lys 


Cys 


Ala 


Gly 








1060 








1065 








1070 




Ala 


Ser 


Glu 


Glu 


Arg 


Thr 


Ala 


Cys 


Val 


Trp 


Gly 


Pro 


Trp 


Ala 


Gly 


Pro 






1075 








1080 








1085 






Ala 


Leu 


Thr 


Phe 


Asp 


Asp 


Cys 


Cys 


Cys 


Arg 


Gin 


Pro 


Arg 


Leu 


Gly 


Thr 




1090 








1095 








1100 








Gin 


Cys 


Arg 


Pro 


Cys 


Pro 


Pro 


Arg 


Gly 


Thr Gly 


Ser 


Gin 


Cys 


Pro 


Thr 


1105 








1110 








1115 








1120 


Ser 


Gin 


Ser 


Glu 


Ser 


Asn 


Ser 


Phe 


Trp 


Asp 


Thr 


Ser 


Pro 


Leu 


Leu 


Leu 










1125 








1130 








1135 


Gly 


Lys 


Ser 


Pro 


Arg 


Asp 


Glu 


Asp 


Ser 


Ser 


Glu 


Glu 


Asp 


Ser 


Asp 


Glu 








1140 








1145 








1150 




Cys 


Arg 


Cys 


Val 


Ser 


Gly Arg 


Cys 


Val 


Pro 


Arg 


Pro 


Gly 


Gly 


Ala 


Val 






1155 








1160 








1165 






Cys 


Glu 


Cys 


Pro 


Gly 


Gly 


Phe 


Gin 


Leu 


Asp 


Ala 


Ser 


Arg 


Ala 


Arg 


Cys 




1170 








1175 








1180 








Val 


Asp 


rle 


Asp 


Glu 


Cys 


Arg 


Glu 


Leu 


Asn 


Gin 


Arg 


Gly 


Leu 


Leu 


Cys 


1185 








1190 








1195 








1200 


Lys 


Ser 


Glu 


Arg 


Cys 


Val 


Asn 


Thr 


Ser 


Gly 


Ser 


Phe 


Arg 


Cys 


Val 


Cys 










1205 








1210 








1215 


Lys 


Ala 


Gly 


Phe 


Thr 


Arg 


Ser 


Arg 


Pro 


His 


Gly 


Pro 


Ala 


Cys 


Leu 


Ser 








1220 








1225 








1230 




Ala 


Ala 


Ala 


Asp 


Asp 


Ala 


Ala 


He 


Ala 


His 


Thr 


Ser 


Val 


He 


Asp 


His 



1235 1240 1245 



Arg Gly Tyr Phe His 
1250 



210> 12 

211> 3759 

212> DNA 

213> Mus musculus 



400> 12 

atgcgccagg ccggcggatt ggggctgctg gcactactcc tgctggcgct gctgggcccc 60 

ggcggccgag gggtgggccg gccgggcagc ggggcacagg cgggggcggg gcgctgggcc 12 0 

caacgcttca aggtggtctt tgcgcctgtg atctgcaagc ggacctgtct gaagggccag 180 

tgtcgggaca gctgtcagca gggctccaac atgacgctca tcggagagaa cggccacagc 240 

accgacacgc tcaccggttc tgccttccgc gtggtggtgt gccctctacc ctgcatgaac 300 

ggtggccagt gctcttcccg aaaccagtgc ctgtgtcccc cggatttcac ggggcgcttc 360 

tgccaggtgc ctgctgcagg aaccggagct ggcaccggga gttcaggccc cggctggccc 420 
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gaccgggcca tgtccacagg cccgctgccg ccccttgccc cagaaggaga gtctgtggct 480 

agcaaacacg ccatttacgc ggtgcaggtg atcgcagatc ctcccgggcc gggggagggt 54 0 

cctcctgcac aacatgcagc cttcttggtg cccctggggc caggacaaat ctcggcagaa 600 

gtgcaggctc cgccccccgt ggtgaacgtg cgtgtccatc accctcctga agcttccgtt 660 

caggcgcacc gcatcgaggg gccgaacgct gaaggcccag cctcttccca gcacttgctg 720 

ccgcatccca agcccccgca cccgaggcca cccactcaaa agccactggg ccgccgcttc 780 

caggacacat tgcccaagca gccttgtggc agcaaccctt tgcctggcct taccaagcag 340 

gaagattgct gcggtagcat cggtactgcc tggggacaaa gcaagtgtca caagtgccca 900 

cagcttcagt atacaggggt gcagaagcct gtacctgtac gtggggaggt gggtgctgac 960 

tgcccccagg gctacaagag gctcaacagc acccactgcc aggatatcaa cgaatgtgcg 1020 

atgcccggga atgtgtgcca tggtgactgc ctcaacaacc ctggctctta tcgctgtgtc 1080 

tgcccgcccg gtcatagctt gggtcccctc gcagcacagt gcattgccga caaaccagag 1140 

gagaagagcc tgtgtttccg ccttgtgagc accgaacacc agtgccagca ccctctgacc 1200 

acacgcctaa cccgccagct ctgctgctgt agtgtgggta aagcctgggg tgcccggtgc 1260 

cagcgctgcc cggcagatgg tacagcagcc ttcaaggaga tctgccccgg ctgggaaagg 1320 

gtaccatatc ctcacctccc accagacgct caccatccag ggggaaagcg acttctccct 1380 

cttcctgcac ccgacgggcc acccaaaccc cagcagcttc ctgaaagccc cagccgagca 1440 

ccacccctcg aggacacaga ggaagagaga ggagtgacca tggatccacc agtgagtgag 1500 

gagcgatcgg tgcagcagag ccaccccact accaccacct cacccccccg gccttaccca 1560 

gagctcatct ctcgcccctc cccacctacc ttccaccggt tcctgccaga cttgccccca 1620 

tcccgaagtg cagtggagat cgcccccact caggtcacag agaccgatga gtgccgattg 1680 

aaccagaata tctgtggcca tggacagtgt gtgcctggcc cctcggatta ctcctgccac 1740 

tgcaacgctg gctaccggtc acacccgcag caccgctact gtgttgatgt gaacgagtgc 1800 

gaggcagagc cctgcggccc cgggaaaggc atctgtatga acactggtgg ctcctacaat 1860 

tgtcactgca accgaggcta ccgcctccac gtgggtgcag ggggccgctc gtgcgtggac 1920 

ctgaacgagt gcgccaagcc tcacctgtgt ggggacggtg gcttctgcat caacttccct 1980 

ggtcactaca aatgcaactg ctatcctggc taccggctca aggcctcccg accgcccatt 2040 

tgcgaagaca tcgacgagtg tcgcgaccct agcacctgcc ctgatggcaa atgtgaaaac 2100 

aaacctggca gcttcaagtg catcgcctgc cagcctggct accgtagcca ggggggcggg 2160 

gcctgtcgtg atgtcaacga atgctccgag ggtaccccct gctctcctgg atggtgtgag 2220 

aaccttccgg gttcttaccg ttgcacgtgt gcccagggga tacgaacccg cacaggacgc 2280 

ctcagttgca tagacgtgga tgagtgtgag gctgggaaag tgtgccaaga tggcatctgc 2340 

acgaacacac caggctcttt ccagtgtcag tgcctctccg gctatcatct gtcaagggat 2400 

cggagccgct gtgaggacat tgatgaatgt gacttccctg cggcctgcat cgggggtgac 2460 

tgcatcaata ccaatggttc ctacagatgt ctctgtcccc tgggtcatcg gttggtgggc 2520 

ggcaggaagt gcaagaaaga tatagatgag tgcagccagg acccaggcct gtgcctgccc 2580 

catgcctgcg agaacctcca gggctcctat gtctgtgtct gtgatgaggg tttcacactc 2640 

acccaggacc agcatgggtg tgaggaggtg gagcagcccc accacaagaa ggagtgctac 2700 

cttaacttcg atgacacagt gttctgtgac agcgtattgg ctaccaatgt cactcagcag 2760 

gaatgctgtt gctctctggg agctggctgg ggagaccact gcgaaatcta tccctgtcca 2820 

gtctacagct cagccgaatt tcacagcctg gtgcctgatg ggaaaaggct acactcagga 2830 

caacaacatt gtgaactatg cattcctgcc caccgtgaca tcgacgaatg catattgttt 2940 

ggggcagaga tctgcaagga gggcaagtgt gtgaacacgc agcccggcta cgagtgctac 3000 

tgcaagcagg gcttctacta cgatggcaac ctgctggagt gcgtggacgt ggatgagtgc 3060 

ttggatgagt ctaactgcag gaacggagtg tgtgagaaca cacgtggcgg ctaccgctgt 3120 

gcctgcactc cgccggcaga gtacagtccc gcacaggccc agtgtctgat cccggagaga 3180 

tggagcacgc cccagagaga cgtgaagtgt gctggggcca gcgaggagag gacggcatgt 3240 

gtatggggcc cctgggcggg acctgccctc acttttgatg actgctgctg ccgccag^cg 3300 

cggctgggta cccagtgcag accgtgcccg ccacgtggca ccgggtccca gtgcccgact 3360 

tcacagagtg agagcaattc tttctgggac acaagccccc tgctactggg gaagtctccg 3420 

cgagacgaag acagctcaga ggaggattca gatgagtgcc gttgtgtgag cggacgctgt 3480 

gtgccacggc caggcggggc ggtatgcgag tgtcctggag gctttcagct ggacgcctcc 3540 

cgtgcccgct gcgtggacat tgatgagtgc cgagaactga accagcgggg actgctgtgt 3600 

aagagcgagc ggtgcgtgaa caccagtgga tccttccgct gtgtctgcaa agctggcttc 3660 

acgcgcagcc gccctcacgg gcctgcgtgc ctcagcgccg ccgctgatga tgcagccata 3720 

gcccacacct cagtgatcga tcatcgaggg tattttcac 3759 
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<210> 13 

<211> 1332 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (31) . . . (1113) 
<400> 13 

ccaagaattc ggcacgagga gaggccggcc atg gcc age ctg ggg ctg ctg etc 54 

Met Ala Ser Leu Gly Leu Leu Leu 
1 5 

ctg etc tta ctg aca gca ctg cca ccg ctg tgg tec tec tea ctg cct 102 
Leu Leu Leu Leu Thr Ala Leu Pro Pro Leu Trp Ser Ser Ser Leu Pro 
10 15 20 

999 ctg gae act get gaa agt aaa gee ace att gca gac ctg ate ctg 150 
Gly Leu Asp Thr Ala Glu Ser Lys Ala Thr He Ala Asp Leu lie Leu 
25 30 35 40 

tct geg ctg gag aga gee acc gtc ttc eta gaa cag agg ctg eet gaa 198 
Ser Ala Leu Glu Arg Ala Thr Val Phe Leu Glu Gin Arg Leu Pro Glu 
45 50 55 

ate aac ctg gat ggc atg gtg ggg gtc ega gtg ctg gaa gag cag eta 246 
He Asn Leu Asp Gly Met Val Gly Val Arg Val Leu Glu Glu Gin Leu 
60 65 70 

aaa agt gtc egg gag aag tgg gee cag gag ccc ctg ctg caa ccg ctg 294 
Lys Ser Val Arg Glu Lys Trp Ala Gin Glu Pro Leu Leu Gin Pro Leu 
75 80 85 

age ctg cgc gtg ggg atg ctg ggg gag aag ctg gag get gee ate cag 342 
Ser Leu Arg Val Gly Met Leu Gly Glu Lys Leu Glu Ala Ala He Gin 
90 95 100 

aga tec etc cae tac etc aag ctg agt gat ece aag tac eta aga gag 390 
Arg Ser Leu His Tyr Leu Lys Leu Ser Asp Pro Lys Tyr Leu Arg Glu 
105 110 115 120 

ttc cag ctg ace etc cag ccc ggg ttt tgg aag etc cca cat gcc tgg 438 
Phe Gin Leu Thr Leu Gin Pro Gly Phe Trp Lys Leu Pro His Ala Trp 
125 130 135 

ate cac act gat gcc tec ttg gtg tac ccc acg ttc ggg ccc cag gac 486 
He His Thr Asp Ala Ser Leu Val Tyr Pro Thr Phe Gly Pro Gin Asp 
140 145 150 

tea ttc tea gag gag aga agt gae gtg tgc ctg gtg cag ctg ctg gga 534 
Ser Phe Ser Glu Glu Arg Ser Asp Val Cys Leu Val Gin Leu Leu Gly 
155 160 165 

acc ggg acg gac age age gag ece tgc ggc etc tea gac etc tgc agg 582 
Thr Gly Thr Asp Ser Ser Glu Pro Cys Gly Leu Ser Asp Leu Cys Arg 
170 175 180 
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age etc atg acc aag ccc ggc tgc tea gge tac tgc ctg tec cac caa 630 
Ser Leu Met Thr Lys Pro Gly Cys Ser Gly Tyr Cys Leu Ser His Gin 
185 190 195 200 

ctg etc tte ttc etc tgg gee aga atg agg ggg tgc aca cag gga cca 678 
Leu Leu Phe Phe Leu Trp Ala Arg Met Arg Gly Cys Thr Gin Gly Pro 
205 210 215 

etc eaa cag age cag gae tat ate aac etc ttc tgc gcc aac atg atg 726 
Leu Gin Gin Ser Gin Asp Tyr lie Asn Leu Phe Cys Ala Asn Met Met 
220 225 230 

gae ttg aac cgc aga get gag gee ate gga tac gcc tac cct acc egg 774 
Asp Leu Asn Arg Arg Ala Glu Ala lie Gly Tyr Ala Tyr Pro Thr Arg 
235 240 245 

gae ate ttc atg gaa aac ate atg ttc tgt gga atg gge ggc tte tec 822 
Asp lie Phe Met Glu Asn lie Met Phe Cys Gly Met Gly Gly Phe Ser 
250 255 260 

gae tte tac aag etc egg tgg ctg gag gee att etc age tgg cag aaa 870 
Asp Phe Tyr Lys Leu Arg Trp Leu Glu Ala lie Leu Ser Trp Gin Lys 
265 270 275 280 

cag cag gaa gga tgc ttc ggg gag cct gat get gaa gat gaa gaa tta 918 
Gin Gin Glu Gly Cys Phe Gly Glu Pro Asp Ala Glu Asp Glu Glu Leu 
285 290 295 

tet aaa get att caa tat cag cag eat t tt teg agg aga gtg aag agg 966 
Ser Lys Ala lie Gin Tyr Gin Gin His Phe Ser Arg Arg Val Lys Arg 
300 305 310 

ega gaa aaa caa ttt cca gat ggc tgc tec tee cac aac aca gcc aca 1014 
Arg Glu Lys Gin Phe Pro Asp Gly Cys Ser Ser His Asn Thr Ala Thr 
315 320 325 

gca gtg gca gcc etg ggt gge ttc eta tac ate ctg gea gaa tac ccc 1062 
Ala Val Ala Ala Leu Gly Gly Phe Leu Tyr lie Leu Ala Glu Tyr Pro 
330 335 340 

cca gca aac aga gag eea cac cca tec aca eeg cca cca cca age age 1110 
Pro Ala Asn Arg Glu Pro His Pro Ser Thr Pro Pro Pro Pro Ser Ser 
345 350 355 360 

ege tgagaeggac ggtteeatgc eagctgcctg gaggaggaae agaccecttt 1163 
Arg 

agtectcatc ccttagatcc tggagggcac ggatcacate ctgggaagaa ggcatctgga 1223 

ggataagcaa ageeaeeecg aeaeceaatc ttggaagece tgagtaggca gggccagggt 1283 

aggtgggggc cgggagggac ecaggtgtga acggatgaat aaagttcaa 13 32 

<210> 14 
<211> 361 
<212> PRT 

<213> Homo sapiens 
<400> 14 
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PCT/IIS00/181S4 



Met 


Ala 


Ser 


Leu 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Thr 


Ala 


Leu 


Pro 


1 








5 










1 0 










15 




Pro 


Leu 


Trp 


Ser 


Ser 


Ser 


Leu 


Pro 


Gly 


Leu 


Asp 


Thr 


Ala 


Glu 


Ser 


Lys 








2 0 










2 5 










3 0 






Ala 


Thr 


lie 


Ala 


Asp 


Leu 


lie 


Leu 


Ser 


Ala 


Leu 


Glu 


Arg 


Ala 


Thr 


Val 






3 5 










4 0 










4 5 








Phe 


Leu 


Glu 


Gin 


Arg 


Leu 


Pro 


Glu 


He 


Asn 


Leu 


Asp 


Gly 


Met 


Val 


Gly 




50 










5 5 










60 










Val 


Arg 


Val 


Leu 


Glu 


Glu 


Gin 


Leu 


Lys 


Ser 


Val 


Arg 


Glu 


Lys 


Trp 


Ala 


65 










70 










75 










80 


Gin 


Glu 


Pro 


Leu 


Leu 


Gin 


Pro 


Leu 


Ser 


Leu 


Arg 


Val 


Gly 


Met 


Leu 


Gly 










85 










90 










95 




Glu 


Lys 


Leu 


Glu 


Ala 


Ala 


He 


Gin 


Arg 


Ser 


Leu 


His 


Tyr 


Leu 


Lys 


Leu 








100 










105 










110 






Ser 


Asp 


Pro 


Lys 


Tyr 


Leu 


Arg 


Glu 


Phe 


Gin 


Leu 


Thr 


Leu 


Gin 


Pro 


Gly 






115 










12 0 










12 5 








Phe 


Trp 


Lys 


Leu 


Pro 


His 


Ala 


Trp 


He 


His 


Thr 


Asp 


Ala 


Ser 


Leu 


Val 




13 0 










135 










14 0 










Tyr 


Pro 


Thr 


Phe 


Gly 


Pro 


Gin 


Asp 


Ser 


Phe 


Ser 


Glu 


Glu 


Arg 


Ser 


Asp 


145 










150 










155 










160 


Val 


Cys 


Leu 


Val 


Gin 


Leu 


Leu 


Gly 


Thr 


Gly 


Thr 


Asp 


Ser 


Ser 


Glu 


Pro 










165 










17 0 










175 




Cys 


Gly 


Leu 


Ser 


Asp 


Leu 


Cys 


Arg 


Ser 


Leu 


Met 


Thr 


Lys 


Pro 


Gly 


Cys 








18 0 










18 5 










190 






Ser 


Gly 


Tyr 


Cys 


Leu 


Ser 


His 


Gin 


Leu 


Leu 


Phe 


Phe 


Leu 


Trp 


Ala 


Arg 






195 










2 0 0 










205 








Met 


Arg 


Gly 


Cys 


Thr 


Gin Gly 


Pro 


Leu 


Gin 


Gin 


Ser 


Gin 


Asp 


Tyr 


He 




210 










215 










2 2 0 










Asn 


Leu 


Phe 


Cys 


Ala 


Asn 


Met 


Met 


Asp 


Leu 


Asn 


Arg 


Arg 


Ala 


Glu 


Al a 


22 5 










230 










235 










24 0 


lie 


Gly 


Tyr 


Ala 


Tyr 


Pro 


Thr 


Arg 


Asp 


He 


Phe 


Met 


Glu 


Asn 


He 


Met 










24 5 










2 5 0 










2 55 




Phe 


Cys 


Gly 


Met 


Gly 


Gly 


Phe 


Ser 


Asp 


Phe 


Tyr 


Lys 


Leu 


Arg 


Trp 


Leu 








260 










265 










270 






Glu 


Ala 


He 


Leu 


Ser 


Trp 


Gin 


Lys 


Gin 


Gin 


Glu 


Gly 


Cys 


Phe 


Gly 


Glu 






275 










280 










285 








Pro 


Asp 


Ala 


Glu 


Asp 


Glu 


Glu 


Leu 


Ser 


Lys 


Ala 


He 


Gin 


Tyr 


Gin 


Gin 




2 90 










295 










300 










His 


Phe 


Ser 


Arg 


Arg 


Val 


Lys 


Arg 


Arg 


Glu 


Lys 


Gin 


Phe 


Pro 


Asp 


Gly 


305 










310 










315 










320 


Cys 


Ser 


Ser 


His 


Asn 


Thr 


Ala 


Thr 


Ala 


Val 


Ala 


Ala 


Leu 


Gly 


Gly 


Phe 










325 










330 










335 




Leu 


Tyr 


He 


Leu 


Ala 


Glu 


Tyr 


Pro 


Pro 


Ala 


Asn 


Arg 


Glu 


Pro 


His 


Pro 








340 










345 










350 






Ser 


Thr 


Pro 


Pro 


Pro 


Pro 


Ser 


Ser 


Arg 

















355 360 



<210> 15 
<211> 1083 
<212> DNA 

<213> Homo sapiens 
<400> 15 

atggccagcc tggggctgct gctcctgctc ttactgacag cactgccacc gctgtggtcc 60 
tcctcactgc ctgggctgga cactgctgaa agtaaagcca ccattgcaga cctgatcctg 120 
tctgcgctgg agagagccac cgtcttccta gaacagaggc tgcctgaaat caacctggat 180 
ggcatggtgg gggtccgagt gctggaagag cagctaaaaa gtgtccggga gaagtgggcc 240 
caggagcccc tgctgcaacc gctgagcctg cgcgtgggga tgctggggga gaagctggag 300 
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gctgccatcc agagatccct ccactacctc aagctgagtg atcccaagta cctaagagag 360 

ttccagctga ccctccagcc cgggttttgg aagctcccac atgcctggat ccacactgat 420 

gcctccttgg tgtaccccac gttcgggccc caggactcat tctcagagga gagaagtgac 480 

gtgtgcctgg tgcagctgcc gggaaccggg acggacagca gcgagccctg cggcctctca 540 

gacctctgca ggagcctcat gaccaagccc ggctgctcag gctactgcct gtcccaccaa 600 

ctgctcttct tcctctgggc cagaatgagg gggtgcacac agggaccact ccaacagagc 660 

caggactata ccaacctctt ctgcgccaac atgatggact cgaaccgcag agctgaggcc 720 

atcggatacg cctaccctac ccgggacatc ttcatggaaa acatcatgtt ctgtggaatg 780 

ggcggcttct ccgacttcta caagctccgg tggctggagg ccattctcag ctggcagaaa 840 

cagcaggaag gatgcttcgg ggagcctgat gctgaagatg aagaattatc taaagctatt 900 

caatatcagc agcatttttc gaggagagtg aagaggcgag aaaaacaatt tccagatggc 960 

tgctcctccc acaacacagc cacagcagtg gcagccctgg gtggcttcct atacatcctg 1020 

gcagaatacc ccccagcaaa cagagagcca cacccatcca caccgccacc accaagcagc 1080 

cgc 1083 

<210> 16 
<211> 1400 
<212> DNA 

<213> Mus musculus 

<220> 
<221> CDS 

<222> (41) . . . (1195) 
<400> 16 

gtcgacccac gcgtccgcat ccaccagcag aaatcctgtc atg gcg aga etc ggg 55 

Met Ala Arg Leu Gly 
1 5 

ctg ctt etc etc ctg ctg ctg gee ctg cca eca cac ttc tec tea gtg 103 
Leu Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro His Phe Ser Ser Val 
10 15 20 

tea tgg cca gac act gea cag ggc acc atg gea aac ttg ate ctg act 151 
Ser Trp Pro Asp Thr Ala Gin Gly Thr Met Ala Asn Leu lie Leu Thr 
25 30 35 

gea tta gaa aaa gee acc ttg ttc ttg gag gac agg ctg ccc aca ate 199 
Ala Leu Glu Lys Ala Thr Leu Phe Leu Glu Asp Arg Leu Pro Thr lie 
40 45 50 

aac ctg gat ggt gtg gtg ggc ttc caa gtg ctg gaa gtg caa etc cga 247 
Asn Leu Asp Gly Val Val Gly Phe Gin Val Leu Glu Val Gin Leu Arg 
55 60 65 

gga gtt cag gaa aaa tgg get cac aag ccc ttg ctg cag cct etc age 295 
Gly Val Gin Glu Lys Trp Ala His Lys Pro Leu Leu Gin Pro Leu Ser 
70 75 80 85 

atg cgc get gga cag atg gcc aac aca ctg tet get etc etc caa aaa 343 
Met Arg Ala Gly Gin Met Ala Asn Thr Leu Ser Ala Leu Leu Gin Lys 
90 95 100 

tec ate ttc tac etc aag cag agt gac ccc acg tac eta aga gag ttc 391 
Ser lie Phe Tyr Leu Lys Gin Ser Asp Pro Thr Tyr Leu Arg Glu Phe 
105 110 115 
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cag cca age att cag cct ggg ttt tgg aag ttg ccc aat gac tgg aca 439 
Gin Pro Ser lie Gin Pro Gly Phe Trp Lys Leu Pro Asn Asp Trp Thr 
120 125 130 

cgc acc aat gcc tec eta gtc tac ccc tgg ctg gaa ccc ctg gac tct 467 
Arg Thr Asn Ala Ser Leu Val Tyr Pro Trp Leu Glu Pro Leu Asp Ser 
135 140 145 

ttc tea gag gaa age age gat gtg tgc etg gtg eaa eta eta gga aca 535 
Phe Ser Glu Glu Ser Ser Asp Val Cys Leu Val Gin Leu Leu Gly Thr 
150 155 160 165 

ggg aca gac age age eag cet tge agg etc tec aac ttc tgc aga acc 583 
Gly Thr Asp Ser Ser Gin Pro Cys Arg Leu Ser Asn Phe Cys Arg Thr 
170 175 180 

ett atg acc aag gcc ggc tgc tea ggc tac age etc tec eat cag etg 631 
Leu Met Thr Lys Ala Gly Cys Ser Gly Tyr Ser Leu Ser His Gin Leu 
185 190 195 

etc ttc ttc etc tgg gee aga atg eaa ggg tgc acg gag gga ctg ttc 679 
Leu Phe Phe Leu Trp Ala Arg Met Gin Gly Cys Thr Glu Gly Leu Phe 
200 205 210 

etc cag age eaa eae tae atg gac ate ttc tgt gcc aat atg atg gaa 727 
Leu Gin Ser Gin His Tyr Met Asp lie Phe Cys Ala Asn Met Met Glu 
215 220 225 

etg aac eac aga get gag gcc gtt gga tae get tac ccc ace caa gac 775 
Leu Asn His Arg Ala Glu Ala Val Gly Tyr Ala Tyr Pro Thr Gin Asp 
230 235 240 245 

etc ttc atg gaa aac att atg ttc tgt ggt atg get gge ttc tct gac 823 
Leu Phe Met Glu Asn lie Met Phe Cys Gly Met Ala Gly Phe Ser Asp 
250 255 260 

ttc tac aag ctg cgc tgg ctg gag gcc att etc age tgg eag aac ccc 871 
Phe Tyr Lys Leu Arg Trp Leu Glu Ala lie Leu Ser Trp Gin Asn Pro 
265 270 275 

cag gtg gga tgc ttc ggg agg cct gac aca aag ggt gaa cct tct gaa 919 
Gin Val Gly Cys Phe Gly Arg Pro Asp Thr Lys Gly Glu Pro Ser Glu 
280 285 290 

gtt cca cat cag cag ggc att ctg aga aga gtg ega agg egg gaa aaa 967 
Val Pro His Gin Gin Gly lie Leu Arg Arg Val Arg Arg Arg Glu Lys 
295 300 305 

ctg ttc gca gat ggc tgt teg tgc eac aac aca gee aca gca gtc gca 1015 
Leu Phe Ala Asp Gly Cys Ser Cys His Asn Thr Ala Thr Ala Val Ala 
310 315 320 325 

gee ctg ggt gge ttt etc tac ate ctg gca gaa tac eac cca gac aat 1063 
Ala Leu Gly Gly Phe Leu Tyr lie Leu Ala Glu Tyr His Pro Asp Asn 
330 335 340 
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gga gat gca cat cca gaa tac tac cca aac cat gga gat cca tac tea 1111 
Gly Asp Ala His Pro Glu Tyr Tyr Pro Asn His Gly Asp Pro Tyr Ser 
345 350 355 

tec tea cag tea eca gca age aac tac caa gat ggt get gee ggc ect 1159 
Ser Ser Gin Ser Pro Ala Ser Asn Tyr Gin Asp Gly Ala Ala Gly Pro 
360 365 370 

gac gtc cag agg act ggc agg ccc ctt agt gtt tct taagteetga 1205 
Asp Val Gin Arg Thr Gly Arg Pro Leu Ser Val Ser 
375 380 385 

gtcagaggtc acaggctgag gaggcaattg aggaaagtga ceagctatat ccccatcgcc 1265 

aettctgggt gtttaaaagt cttgggagag cagggccagg gaaagcaggg ttggagagtg 1325 

gggtggccca gatgtcagca gaataeataa agcaeagtca attggagctg aaaaaaaaaa 1385 

aaaaagggcg gccgc 1400 

:210> 17 
:211> 385 
:212> PRT 

:213> Mus musculus 
:400> 17 

Met Ala Arg Leu Gly Leu Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 

15 10 15 

His Phe Ser Ser Val Ser Trp Pro Asp Thr Ala Gin Gly Thr Met Ala 

20 25 30 

Asn Leu lie Leu Thr Ala Leu Glu Lys Ala Thr Leu Phe Leu Glu Asp 

35 40 45 

Arg Leu Pro Thr lie Asn Leu Asp Gly Val Val Gly Phe Gin Val Leu 

50 55 60 

Glu Val Gin Leu Arg Gly Val Gin Glu Lys Trp Ala His Lys Pro Leu 
65 70 75 80 

Leu Gin Pro Leu Ser Met Arg Ala Gly Gin Met Ala Asn Thr Leu Ser 

85 90 95 

Ala Leu Leu Gin Lys Ser lie Phe Tyr Leu Lys Gin Ser Asp Pro Thr 

100 105 110 

Tyr Leu Arg Glu Phe Gin Pro Ser lie Gin Pro Gly Phe Trp Lys Leu 

115 120 125 

Pro Asn Asp Trp Thr Arg Thr Asn Ala Ser Leu Val Tyr Pro Trp Leu 

130 135 140 

Glu Pro Leu Asp Ser Phe Ser Glu Glu Ser Ser Asp Val Cys Leu Val 
145 150 155 160 

Gin Leu Leu Gly Thr Gly Thr Asp Ser Ser Gin Pro Cys Arg Leu Ser 

165 170 175 

Asn Phe Cys Arg Thr Leu Met Thr Lys Ala Gly Cys Ser Gly Tyr Ser 

180 185 190 

Leu Ser His Gin Leu Leu Phe Phe Leu Trp Ala Arg Met Gin Gly Cys 

195 200 205 

Thr Glu Gly Leu Phe Leu Gin Ser Gin His Tyr Met Asp lie Phe Cys 

210 215 220 

Ala Asn Met Met Glu Leu Asn His Arg Ala Glu Ala Val Gly Tyr Ala 
225 230 235 240 

Tyr Pro Thr Gin Asp Leu Phe Met Glu Asn lie Met Phe Cys Gly Met 

245 250 255 

Ala Gly Phe Ser Asp Phe Tyr Lys Leu Arg Trp Leu Glu Ala lie Leu 
260 265 270 
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ir I ic 
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Arg 


Pro 
2 8 5 


Asp 


Thr 


Lys 


bi y 


Glu 
290 


Pro 


Ser 


Glu 


Val 


Pro 
295 


MIS 


oX n 


Pin 
o X 11 


o X y 


X X t: 

300 


Leu 


Arg 


Arg 


V ct X 


Arg 


Arg 
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Glu 


Lys 


Leu 
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Asp 
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Lys 


o e X 


P\/ c 

Lys 


Hie; 

n X £3 


As n 


Th -r 
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310 










315 










320 


AI a 


Thr 


Aia 


vai 


Ala 
325 


Ala 


Leu 


r''\ \r 

Liiy 


uxy 


fne 
330 


Leu 


Tyr 


X X e 


Leu 


AX a 
335 


Pin 
Lj X U 


Tyr 


His 


Pro 


Asp 


Asn Gly 


Asp 


Ala 


His 


Pro 


Glu 


Tyr 


Tyr 


Pro 


Asn 


His 








340 










345 










350 






Gly 


Asp 


Pro 

355 


Tyr 


Ser 


Ser 


Ser 


Gin 
360 


Ser 


Pro 


Ala 


Ser 


Asn 
365 


Tyr 


Gin 


Asp 


Gly 


Ala 
370 


Ala 


Gly 


Pro 


Asp 


Val 
375 


Gin 


Arg 


Thr 


Gly 


Arg 
3B0 


Pro 


Leu 


Ser 


Val 



Ser 
385 



<210> 18 

<211> 1155 

<212> DNA 

<213> Mus musculus 

<400> 18 

atggcgagac tcgggctgct tctccccctg ctgctggccc tgccaccaca cttctcctca 60 

gtgtcatggc cagacactgc acagggcacc atggcaaact tgatcctgac tgcattagaa 120 

aaagccacct tgttcttgga ggacaggctg cccacaatca acctggatgg tgtggtgggc 180 

ttccaagtgc tggaagtgca actccgagga gttcaggaaa aatgggctca caagcccttg 240 

ctgcagcctc tcagcatgcg cgctggacag atggccaaca cactgtctgc tctcctccaa 300 

aaatccatct tctacctcaa gcagagtgac cccacgtacc taagagagtt ccagccaagc 360 

attcagcctg ggttttggaa gttgcccaat gactggacac gcaccaatgc ctccctagtc 420 

tacccctggc tggaacccct ggactctttc tcagaggaaa gcagcgatgt gtgcctggtg 480 

caactactag gaacagggac agacagcagc cagccttgca ggctctccaa cttctgcaga 540 

acccttatga ccaaggccgg ctgctcaggc tacagcctct cccatcagct gctcttcttc 600 

ctctgggcca gaatgcaagg gtgcacggag ggactgttcc tccagagcca acactacatg 660 

gacatcttct gtgccaatat gatggaactg aaccacagag ctgaggccgt tggatacgct 720 

taccccaccc aagacctctt catggaaaac attatgttct gtggtatggc tggcttctct 780 

gacttctaca agctgcgctg gctggaggcc attctcagct ggcagaaccc ccaggtggga 840 

tgcttcggga ggcctgacac aaagggtgaa ccttctgaag ttccacatca gcagggcatt 900 

ctgagaagag tgcgaaggcg ggaaaaactg ttcgcagatg gctgttcgtg ccacaacaca 960 

gccacagcag tcgcagccct gggtggcttt ctctacatcc tggcagaata ccacccagac 1020 

aatggagatg cacatccaga atactaccca aaccatggag atccatactc atcctcacag 1080 

tcaccagcaa gcaactacca agatggtgct gccggccctg acgtccagag gactggcagg 1140 

ccccttagtg tttct 1155 



<210> 19 

<211> 1356 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (105) . . , (1148) 
<400> 19 
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gtcgacccac gcgtcccgca gcctgcagcc cgcagcccgc agcccggagc cagatcgcgg 60 
gctcagacca aacccgactc gaccgccgcc cccagccagg cgcc atg ctg ccg ctt 116 

Met Leu Pro Leu 
1 

ctg ctg ggc ctg ctg ggc cca gcg gcc tgc tgg gcc ctg ggc ccg acc 164 
Leu Leu Gly Leu Leu Gly Pro Ala Ala Cys Trp Ala Leu Gly Pro Thr 
5 10 15 20 

GCC ggc ccg gga tec tct gag ctg cgc teg gcc ttc teg gcg gca cgc 212 
Pro Gly Pro Gly Ser Ser Glu Leu Arg Ser Ala Phe Ser Ala Ala Arg 
25 30 35 

acc acc ccc ctg gag ggc acg teg gag atg gcg gtg acc ttc gac aag 260 
Thr Thr Pro Leu Glu Gly Thr Ser Glu Met Ala Val Thr Phe Asp Lys 
40 45 50 

gtg tac gtg aac ate ggg ggc gac ttc gat gtg gcc acc ggc cag ttt 308 
Val Tyr Val Asn lie Gly Gly Asp Phe Asp Val Ala Thr Gly Gin Phe 
55 60 65 

cgc tgc cgc gtg ccc ggc gee tac ttc ttc tec ttc acg get ggc aag 356 
Arg Cys Arg Val Pro Gly Ala Tyr Phe Phe Ser Phe Thr Ala Gly Lys 
70 75 80 

gcc ccg eac aag age ctg teg gtg atg ctg gtg cga aac cgc gac gag 404 
Ala Pro His Lys Ser Leu Ser Val Met Leu Val Arg Asn Arg Asp Glu 
85 90 95 100 

gtg eag gcg ctg gcc ttc gac gag cag egg egg cca ggc gcg egg cgc 452 
Val Gin Ala Leu Ala Phe Asp Glu Gin Arg Arg Pro Gly Ala Arg Arg 
105 110 115 

gca gcc age cag age gee atg ctg cag etc gac tac ggc gac aca gtg 500 
Ala Ala Ser Gin Ser Ala Met Leu Gin Leu Asp Tyr Gly Asp Thr Val 
120 125 130 

tgg ctg egg ctg cat ggc gcc ccg cag tac gcg eta ggc gcg ccc ggc 548 
Trp Leu Arg Leu His Gly Ala Pro Gin Tyr Ala Leu Gly Ala Pro Gly 
135 140 145 

gcc acc ttc age ggc tac eta gtc tac gcc gac gcc gac get gac gcg 596 
Ala Thr Phe Ser Gly Tyr Leu Val Tyr Ala Asp Ala Asp Ala Asp Ala 
150 155 160 

cet gcg cgc ggg ccg ccc gcg ccc ccc gag ccg cgc teg gcc ttc teg 644 
Pro Ala Arg Gly Pro Pro Ala Pro Pro Glu Pro Arg Ser Ala Phe Ser 
165 170 175 180 

gcg gcg cgc acg cgc age ttg gtg ggc teg gac get ggc ccc ggg ccg 692 
Ala Ala Arg Thr Arg Ser Leu Val Gly Ser Asp Ala Gly Pro Gly Pro 
185 190 195 

egg eac caa cca etc gcc ttc gac acc gag ttc gtc aac att ggc ggc 740 
Arg His Gin Pro Leu Ala Phe Asp Thr Glu Phe Val Asn lie Gly Gly 
200 205 210 
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gac ttc gac gcg gcg gcc ggc gtg ttc cgc tgc cgt ctg ccc ggc gcc 788 
Asp Phe Asp Ala Ala Ala Gly Val Phe Arg Cys Arg Leu Pro Gly Ala 
215 220 225 

tac etc ttc tec ttc acg ctg ggc aag ctg ccg cgt aag acg ctg teg 836 
Tyr Phe Phe Ser Phe Thr Leu Gly Lys Leu Pro Arg Lys Thr Leu Ser 
230 235 240 

gtt aag ctg atg aag aac cgc gac gag gtg eag gcc atg att tac gae 884 
Val Lys Leu Met Lys Asn Arg Asp Glu Val Gin Ala Met lie Tyr Asp 
245 250 255 260 

gac ggc gcg teg egg cgc cgc gag atg eag age cag age gtg atg ctg 932 
Asp Gly Ala Ser Arg Arg Arg Glu Met Gin Ser Gin Ser Val Met Leu 
265 270 275 

gcc ctg egg cgc ggc gac gee gte tgg ctg etc age cac gac eae gac 980 
Ala Leu Arg Arg Gly Asp Ala Val Trp Leu Leu Ser His Asp His Asp 
280 285 290 

ggc tac ggc gcc tac age aac cac gat etc cca act gac etc aaa acg 1028 
Gly Tyr Gly Ala Tyr Ser Asn His Asp Leu Pro Thr Asp Leu Lys Thr 
295 300 305 

gtt ttg ecg agt tgg gae gte cac tgc tgt caa gtc aac eag aga ttt 1076 
Val Leu Pro Ser Trp Asp Val His Cys Cys Gin Val Asn Gin Arg Phe 
310 315 320 

gaa ctg tgc att ggt gtg ate cct gag gaa agt cag cac tgg gat gac 1124 
Glu Leu Cys lie Gly Val lie Pro Glu Glu Ser Gin His Trp Asp Asp 
325 330 335 340 

gee ate agg atg gat aca gac etc taactcattg aageaggaca eetgeacaea 1178 
Ala lie Arg Met Asp Thr Asp Leu 
345 

tgaaagtgag gggagaggga acaaagagct actgagggaa cagctaactt eagctggagt 1238 

cacetggttt aatgctgaga gaaaagtcca agcttgggat ggaggaatct gtagtttett 1298 

tgaaaeaagt etgcccacte eeaeaaaaaa aaaaaaaaaa aaaaaaaagg gcggccge 1356 

<210> 20 
<211> 348 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Met Leu Pro Leu Leu Leu Gly Leu Leu Gly Pro Ala Ala Cys Trp Ala 

15 10 15 

Leu Gly Pro Thr Pro Gly Pro Gly Ser Ser Glu Leu Arg Ser Ala Phe 

20 25 30 

Ser Ala Ala Arg Thr Thr Pro Leu Glu Gly Thr Ser Glu Met Ala Val 

35 40 45 

Thr Phe Asp Lys Val Tyr Val Asn He Gly Gly Asp Phe Asp Val Ala 

50 55 60 

Thr Gly Gin Phe Arg Cys Arg Val Pro Gly Ala Tyr Phe Phe Ser Phe 
65 70 75 80 

Thr Ala Gly Lys Ala Pro His Lys Ser Leu Ser Val Met Leu Val Arg 
85 90 95 



35 



wo 01/00672 PCT/US0(I/18184 

Asn Arg Asp Glu Val Gin Ala Leu Ala Phe Asp Glu Gin Arg Arg Pro 

100 105 110 

Gly Ala Arg Arg Ala Ala Ser Gin Ser Ala Met Leu Gin Leu Asp Tyr 

115 120 125 

Gly Asp Thr Val Trp Leu Arg Leu His Gly Ala Pro Gin Tyr Ala Leu 

130 135 140 

Gly Ala Pro Gly Ala Thr Phe Ser Gly Tyr Leu Val Tyr Ala Asp Ala 
145 150 155 160 

Asp Ala Asp Ala Pro Ala Arg Gly Pro Pro Ala Pro Pro Glu Pro Arg 

165 170 175 

Ser Ala Phe Ser Ala Ala Arg Thr Arg Ser Leu Val Gly Ser Asp Ala 

180 185 190 

Gly Pro Gly Pro Arg His Gin Pro Leu Ala Phe Asp Thr Glu Phe Val 

195 200 205 

Asn lie Gly Gly Asp Phe Asp Ala Ala Ala Gly Val Phe Arg Cys Arg 

210 215 220 

Leu Pro Gly Ala Tyr Phe Phe Ser Phe Thr Leu Gly Lys Leu Pro Arg 
225 230 235 240 

Lys Thr Leu Ser Val Lys Leu Met Lys Asn Arg Asp Glu Val Gin Ala 

245 250 255 

Met lie Tyr Asp Asp Gly Ala Ser Arg Arg Arg Glu Met Gin Ser Gin 

260 265 270 

Ser Val Met Leu Ala Leu Arg Arg Gly Asp Ala Val Trp Leu Leu Ser 

275 280 285 

His Asp His Asp Gly Tyr Gly Ala Tyr Ser Asn His Asp Leu Pro Thr 

290 295 300 

Asp Leu Lys Thr Val Leu Pro Ser Trp Asp Val His Cys Cys Gin Val 
305 310 315 320 

Asn Gin Arg Phe Glu Leu Cys lie Gly Val lie Pro Glu Glu Ser Gin 

325 330 335 

His Trp Asp Asp Ala lie Arg Met Asp Thr Asp Leu 
340 345 



<210> 21 

<211> 1044 

<212> DNA 

<213> Homo sapiens 



<400> 21 

atgctgccgc ttctgctggg cctgctgggc ccagcggcct gctgggccct gggcccgacc 60 

cccggcccgg gatcctctga gctgcgctcg gccttctcgg cggcacgcac cacccccctg 120 

gagggcacgt cggagatggc ggtgaccttc gacaaggtgt acgtgaacat cgggggcgac 180 

ttcgatgtgg ccaccggcca gtttcgctgc cgcgtgcccg gcgcctactt cttctccttc 240 

acggctggca aggccccgca caagagcctg tcggtgatgc tggtgcgaaa ccgcgacgag 300 

gtgcaggcgc tggccttcga cgagcagcgg cggccaggcg cgcggcgcgc agccagccag 360 

agcgccatgc tgcagctcga ctacggcgac acagtgtggc tgcggctgca tggcgccccg 420 

cagtacgcgc taggcgcgcc cggcgccacc ttcagcggct acctagtcta cgccgacgcc 480 

gacgctgacg cgcctgcgcg cgggccgccc gcgccccccg agccgcgctc ggccttctcg 540 

gcggcgcgca cgcgcagctt ggtgggctcg gacgctggcc ccgggccgcg gcaccaacca 600 

ctcgccttcg acaccgagtt cgtcaacatt ggcggcgact tcgacgcggc ggccggcgtg 660 

ttccgctgcc gtctgcccgg cgcctacttc ttctccttca cgctgggcaa gctgccgcgt 720 

aagacgctgt cggttaagct gatgaagaac cgcgacgagg tgcaggccat gatttacgac 780 

gacggcgcgt cgcggcgccg cgagatgcag agccagagcg tgatgctggc cctgcggcgc 84 0 

ggcgacgccg tctggctgct cagccacgac cacgacggct acggcgccta cagcaaccac 900 

gatctcccaa ctgacctcaa aacggttttg ccgagttggg acgtccactg ctgtcaagtc 960 

aaccagagat ttgaactgtg cattggtgtg atccctgagg aaagtcagca ctgggatgac 1020 

gccatcagga tggatacaga cctc 1044 
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<210> 22 
<211> 1388 
<212> DNA 
<;213> Catarrhini 

<220> 
<221> CDS 

<222> (250) . . . (1236) 
<400> 22 

gtcgacccac gcgtccgggc tccctccctg gctgaccttc cctttccctc acgcctcccc 60 
acgcccggcc tggggcccca gcaccctgtc cgccgccgcc tcagagccgg gagaagcagc 120 
cggagccccc ggcgcccccg ccgcagcgcg ggcggtcagt gcgcagcccg gcgcccgcag 180 
cccgcagccc gcagccggat cgcgggctcg gaccgaaccc gaccggaccg ccgcccccag 240 
ccaggctcc atg ctg ccg ctt ctg eta ggc ctg ctg ggc cca gcg gcc tgc 291 
Met Leu Pro Leu Leu Leu Gly Leu Leu Gly Pro Ala Ala Cys 
15 10 



tgg gcc ctg ggc ccg gcc ccc ggc ccg ggc tec tea gag ctg cgc teg 339 
Trp Ala Leu Gly Pro Ala Pro Gly Pro Gly Ser Ser Glu Leu Arg Ser 
15 20 25 30 

gcc ttc teg gcg gca cgc acc act ccc ctg gag ggc gcg teg gag atg 387 
Ala Phe Ser Ala Ala Arg Thr Thr Pro Leu Glu Gly Ala Ser Glu Met 
35 40 45 

gcg gtg acc ttc gac aag gtg tac gtg aac ate ggg ggc gac ttc gac 435 
Ala Val Thr Phe Asp Lys Val Tyr Val Asn lie Gly Gly Asp Phe Asp 
50 55 60 

gcg gcc aea ggc caa ttc cgc tgc cgc gtg ccc ggc gcc tac ttc ttc 4 83 

Ala Ala Thr Gly Gin Phe Arg Cys Arg Val Pro Gly Ala Tyr Phe Phe 
65 70 75 

tec ttc aeg gtt ggc aag gcc ccg cac aag age ctg teg gtg atg ctg 531 
Ser Phe Thr Val Gly Lys Ala Pro His Lys Ser Leu Ser Val Met Leu 
80 85 90 

gtg egg aac cac gac gag gtg cag gcg ctg gcc ttc gac gag cag agg 579 
Val Arg Asn His Asp Glu Val Gin Ala Leu Ala Phe Asp Glu Gin Arg 
95 100 105 110 

egg ccc age gca egg cgc gcc gcc age cag age gcc atg ctg cag etc 627 
Arg Pro Ser Ala Arg Arg Ala Ala Ser Gin Ser Ala Met Leu Gin Leu 
115 120 125 

gac tac ggc gac aca gtt tgg ctg egg ctg cat ggc gcc ccg cag tac 675 
Asp Tyr Gly Asp Thr Val Trp Leu Arg Leu His Gly Ala Pro Gin Tyr 
130 135 140 

gcg ctg ggc gcg ccc ggc gcc acc ttc age ggc tac ctg gtc tac gcc 723 
Ala Leu Gly Ala Pro Gly Ala Thr Phe Ser Gly Tyr Leu Val Tyr Ala 
145 150 155 

gac gcc gac gcc gac gcg ect gcg cgc ggg ccg ccc gcg ccc ccc gag 771 
Asp Ala Asp Ala Asp Ala Pro Ala Arg Gly Pro Pro Ala Pro Pro Glu 
160 165 170 
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ccg cgc teg gca ttc teg gcg gcg cgc acg cgc age ctg gtg ggc teg 819 
Pro Arg Ser Ala Phe Ser Ala Ala Arg Thr Arg Ser Leu Val Gly Ser 
175 180 185 190 

gae gcg ggg tec ggg ceg egg cac egg ceg eta gee ttc gae aee gag 867 
Asp Ala Gly Ser Gly Pro Arg His Arg Pro Leu Ala Phe Asp Thr Glu 
195 200 205 

etc gte aac att ggc ggc gae ttc gae geg gcg gee gge gtg ttc cgc 915 
Leu Val Asn He Gly Gly Asp Phe Asp Ala Ala Ala Gly Val Phe Arg 
210 215 220 

tgc cgc ctg ccc ggc gcc tac ttc ttc tec ttc acg ctg ggc aag ctg 963 
Cys Arg Leu Pro Gly Ala Tyr Phe Phe Ser Phe Thr Leu Gly Lys Leu 
225 230 235 

ceg cgt aag acg ctg teg gtg aag ctg atg aag aac cgc gae gag gtg 1011 
Pro Arg Lys Thr Leu Ser Val Lys Leu Met Lys Asn Arg Asp Glu Val 
240 245 250 

cag gcc atg att tae gae gae ggc gcg teg egg cgc ege gag atg cag 1059 
Gin Ala Met He Tyr Asp Asp Gly Ala Ser Arg Arg Arg Glu Met Gin 
255 260 265 270 

age cag age gtg atg ctg gcc ctg egg cgc ggc gae gcc gtc tgg ctg 1107 
Ser Gin Ser Val Met Leu Ala Leu Arg Arg Gly Asp Ala Val Trp Leu 
275 280 285 

etc age cac gae cae gae ggc tac ggc gcc tae age aac cac gge aag 1155 
Leu Ser His Asp His Asp Gly Tyr Gly Ala Tyr Ser Asn His Gly Lys 
290 295 300 

tac ate ace ttc tee ggc ttc ttg gtg tac ccc gae etc gee eee gee 1203 
Tyr He Thr Phe Ser Gly Phe Leu Val Tyr Pro Asp Leu Ala Pro Ala 
305 310 315 

gcc ceg eca gge etc ggg gee ceg gaa ctg ctg tgagceccgg gccagagagg 1256 
Ala Pro Pro Gly Leu Gly Ala Pro Glu Leu Leu 
320 325 

tgcecgggag ggccagggge gtgcatgccg ggccaggecc geggeccgaa egeetegege 1316 
gagcgccacg geegeeggge gcgectggac tctgccaata aageagaaag egggectgcg 1376 
eattgcccgg gc 1388 

<210> 23 
<211> 329 
<212> PRT 
<213> Catarrhini 

<400> 23 

Met Leu Pro Leu Leu Leu Gly Leu Leu Gly Pro Ala Ala Cys Trp Ala 

15 10 15 

Leu Gly Pro Ala Pro Gly Pro Gly Ser Ser Glu Leu Arg Ser Ala Phe 

20 25 30 

Ser Ala Ala Arg Thr Thr Pro Leu Glu Gly Ala Ser Glu Met Ala Val 

35 40 45 

Thr Phe Asp Lys Val Tyr Val Asn He Gly Gly Asp Phe Asp Ala Ala 
50 55 60 
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Thr Gly 
65 

Thr Val 

Asn His 

Ser Ala 

Gly Asp 
130 
Gly Ala 
145 

Asp Ala 

Ser Ala 

Gly Ser 

Asn lie 
210 
Leu Pro 
225 

Lys Thr 

Met He 

Ser Val 

His Asp 
290 
Thr Phe 

305 

Pro Gly 



Gin 

Gly 

Asp 

Arg 
115 
Thr 

Pro 

Asp 

Phe 

Gly 
195 
Gly 

Gly 

Leu 

Tyr 

Met 
275 
His 

Ser 

Leu 



Phe 

Lys 

Glu 
100 
Arg 

Val 

Gly 

Ala 

Ser 
180 
Pro 

Gly 

Ala 

Ser 

Asp 
260 
Leu 

Asp 

Gly 
Gly 



Arg Cys 
70 

Ala Pro 
85 

Val Gin 

Ala Ala 

Trp Leu 

Ala Thr 
150 
Pro Ala 
165 

Ala Ala 

Arg His 

Asp Phe 

Tyr Phe 
230 
Val Lys 
245 

Asp Gly 

Ala Leu 

Gly Tyr 

Phe Leu 
310 
Ala Pro 
325 



Arg Val Pro Gly 
His Lys 
Ala Leu 



Ser Gin 
120 
Arg Leu 
135 

Phe Ser 

Arg Gly 

Arg Thr 

Arg Pro 
200 
Asp Ala 
215 

Phe Ser 

Leu Met 

Ala Ser 

Arg Arg 
280 
Gly Ala 
295 

Val Tyr 



Ser Leu 
90 

Ala Phe 
105 

Ser Ala 



His Gly 

Gly Tyr 

Pro Pro 
170 
Arg Ser 
185 

Leu Ala 

Ala Ala 

Phe Thr 

Lys Asn 
250 
Arg Arg 
265 

Gly Asp 



Tyr Ser 
Pro Asp 
Glu Leu Leu 



Ala Tyr 
75 

Ser Val 

Asp Glu 

Met Leu 

Ala Pro 
140 
Leu Val 
155 

Ala Pro 

Leu Val 

Phe Asp 

Gly Val 
220 
Leu Gly 
235 

Arg Asp 

Arg Glu 

Ala Val 

Asn His 
300 
Leu Ala 
315 



Phe Phe 

Met Leu 

Gin Arg 
110 
Gin Leu 
125 

Gin Tyr 

Tyr Ala 

Pro Glu 

Gly Ser 
190 
Thr Glu 
205 

Phe Arg 

Lys Leu 

Glu Val 

Met Gin 
270 
Trp Leu 
285 

Gly Lys 
Pro Ala 



Ser Phe 
80 

Val Arg 
95 

Arg Pro 

Asp Tyr 

Ala Leu 

Asp Ala 
160 
Pro Arg 
175 

Asp Ala 

Leu Val 

Cys Arg 

Pro Arg 
240 
Gin Ala 
255 

Ser Gin 

Leu Ser 

Tyr He 

Ala Pro 
320 



<210> 24 
<211> 987 
<212> DNA 
<213> Catarrhini 



<400> 24 

atgctgccgc ttctgctagg cctgctgggc ccagcggcct gctgggccct gggcccggcc 60 

cccggcccgg gctcctcaga gctgcgctcg gccttctcgg cggcacgcac cactcccctg 120 

gagggcgcgt cggagatggc ggtgaccttc gacaaggtgt acgtgaacat cgggggcgac 180 

ttcgacgcgg ccacaggcca attccgctgc cgcgtgcccg gcgcctactt cttctccttc 240 

acggttggca aggccccgca caagagcctg tcggtgatgc tggtgcggaa ccacgacgag 300 

gtgcaggcgc tggccttcga cgagcagagg cggcccagcg cacggcgcgc cgccagccag 360 

agcgccatgc tgcagctcga ctacggcgac acagtttggc tgcggctgca tggcgccccg 420 

cagtacgcgc tgggcgcgcc cggcgccacc ttcagcggct acctggtcta cgccgacgcc 480 

gacgccgacg cgcctgcgcg cgggccgccc gcgccccccg agccgcgctc ggcattctcg 540 

gcggcgcgca cgcgcagcct ggtgggctcg gacgcggggt ccgggccgcg gcaccggccg 600 

ctagccttcg acaccgagct cgtcaacatt ggcggcgact tcgacgcggc ggccggcgtg 660 

ttccgctgcc gcctgcccgg cgcctacttc ttctccttca cgctgggcaa gctgccgcgt 720 

aagacgctgt cggtgaagct gatgaagaac cgcgacgagg tgcaggccat gatttacgac 780 

gacggcgcgt cgcggcgccg cgagatgcag agccagagcg tgatgctggc cctgcggcgc 840 

ggcgacgccg tctggctgct cagccacgac cacgacggct acggcgccta cagcaaccac 900 

ggcaagtaca tcaccttctc cggcttcttg gtgtaccccg acctcgcccc cgccgccccg 960 

ccaggcctcg gggccccgga actgctg 987 
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<210> 25 
<211> 625 
<212> DNA 

<213> Mas musculus 
<400> 25 

tcggaccgcc cgccaccagc cacgtgccat gctgctgctc ttgctgggct tcctaggccc 60 

ggcggcctgc tgggcactgg gcccggctgg ccctggctcc tcggagctgc ggtcagcctt 120 

ctcggcggct cgcaccaccc cgctggaggg cacgtcggag atggcggtga ccttcgacaa 180 

ggtgtacgtg aacatcgggg gtgacttcga cgcagccacc gggcggttcc gctgtcgcgt 240 

gccgggcgcc tacttcttct ccttcacggc cggcaaggcc ccgcacaaaa acctgtcggt 300 

gatgctggtg cgcaaccgcg acgaagtgca ggcgctggct ttcgacaagc agcgacggcc 360 

aggcgcgcgg cgcgcggcca gccaaagcgc catgctgcag ctcgactacg gcgacacggt 420 

gtggctgcgg ctgcacggcg ctccgcatta cgcgctcggc gcgccgggcg ccaccttcag 480 

cggctacctg gtgtacgcgg acgccgacgc cgacgcgcct gcgcgcgggc ccgcggcccc 540 

ggagccgcgc tcggccttct ccgcgcgcca cgccacctgg tgggctccga acccgccccg 600 

gcccgcgcca cggcgtttgg ccttc 625 



<210> 26 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 26 

Met Ala Glu Leu Pro Gly Pro Phe Leu Cys Gly Ala Leu Leu Gly Phe 

15 10 15 

Leu Cys Leu Ser Val Pro Pro Ser Asn Pro 
20 25 

<210> 27 
<211> 136 
<212> PRT 

<2 13 > Homo sapiens 
<400> 27 

Leu Cys Ser Gin Ser Gly Gin Thr Ser Val Gly Gly Ser Thr Ala Leu 

15 10 15 

Arg Cys Ser Ser Ser Glu Gly Ala Pro Lys Pro Val Tyr Asn Trp Val 

20 25 30 

Arg Leu Gly Thr Phe Pro Thr Pro Ser Pro Gly Ser Met Val Gin Asp 

35 40 45 

Glu Val Ser Gly Gin Leu lie Leu Thr Asn Leu Ser Leu Thr Ser Ser 

50 55 60 

Gly Thr Tyr Arg Cys Val Ala Thr Asn Gin Met Gly Ser Ala Ser Cys 
65 70 75 80 

Glu Leu Thr Leu Ser Val Thr Glu Pro Ser Gin Gly Arg Val Ala Gly 

85 90 95 

Ala Leu lie Gly Val Leu Leu Gly Val Leu Leu Leu Ser Val Ala Ala 

100 105 110 

Phe Cys Leu Val Arg Phe Gin Lys Glu Arg Gly Lys Lys Pro Lys Glu 

115 120 125 

Thr Tyr Gly Gly Ser Asp Leu Arg 
130 135 

<210> 28 
<211> 61 



40 



wo (»1/(M»672 




<212> PRT 

<213> Homo sapiens 



<400 


> 28 
















Gly 


Gly 


Ser 


Thr 


Ala 


Leu 


Arg 


Cys 


Ser 


Ser 


1 








5 










10 


Pro 


Val 


Tyr 


Asn 
20 


Trp 


Val 


Arg 


Leu 


Gly 
25 


Thr 


Gly 


Ser 


Met 
35 


Val 


Gin 


Asp 


Glu 


Val 
40 


Ser 


Gly 


Leu 


Ser 
50 


Leu 


Thr 


Ser 


Ser 


Gly 
55 


Thr 


Tyr 


Arg 



<210> 29 

<211> 78 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetically generated peptide 



<400> 29 



Gly Glu 


Ser 


Val 


Thr 


Leu 


Thr 


Cys 


Ser 


Val 


1 






5 










10 


Gly Val 


Ser 


Val 


Thr 


Trp 


Tyr 


Phe 


Lys 


Asn 






20 










25 




lie Leu 


Gly 


T>^r 


Ser 


Tyr 


Ser 


Arg 


He 


Glu 




35 










40 






Leu Ser 


Glu 


Gly 


Arg 


Phe 


Ser 


He 


Ser 


Ser 


50 










55 








Ser Val 


Glu 


Lys 


Glu 


Asp 


Ser 


Gly 


Thr 


Tyr 


65 








70 










<210> 30 
















<211> 189 
















<212> PRT 
















<213> Homo sapiens 












<400> 30 
















Arg Asn 


Ser 


Gin 


Leu 


Arg 


He 


Val 


Leu 


Val 


1 






5 










10 


Lys Ser 


Ala 


Thr 


Gly 


Asn 


Ser 


He 


Leu 


Gly 






20 










25 




Gly Thr 


Ala 


Ala 


Lys 


Ser 


He 


Thr 


Lys 


Lys 




35 










40 






Ser Trp 


Lys 


Glu 


Thr 


Glu 


Leu 


Val 


Val 


Val 


50 










55 








Asp Thr 


Glu 


Val 


Pro 


Asn 


Ala 


Glu 


Thr 


Ser 


65 








70 










He Leu 


Leu 


Thr 


Ser 


Pro 


Gly 


Pro 


His 


Ala 








85 










90 


Leu Gly 


Arg 


Tyr 


Thr 


Glu 


Glu 


Glu 


His 


Lys 






100 










105 




Lys Met 


Phe 


Gly 


Glu 


Arg 


Ala 


Arg 


Ser 


Phe 




115 










120 






Arg Lys 


Asp 


Asp 


Leu 


Gly 


Asp 


Thr 


Asn 


Leu 


130 










135 









41 
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Ser Glu Gly Ala Pro Lys 
15 

Phe Pro Thr Pro Ser Pro 
30 

Gin Leu He Leu Thr Asn 
45 

Cys Val Ala 
60 



Ser Gly Phe Gly Pro Pro 

15 

Gly Lys Leu Gly Pro Ser 
30 

Ser Gly Glu Lys Ala Asn 
45 

He Thr Leu Thr He Ser 
60 

Thr Cys Val Val 
75 



Gly Lys Thr Gly Ala Gly 
15 

Arg Lys Val Phe His Ser 
30 

Cys Glu Lys Arg Ser Ser 
45 

Asp Thr Pro Gly He Phe 
60 

Lys Glu He He Arg Cys 
75 80 
Leu Leu Leu Val Val Pro 
95 

Ala Thr Glu Lys He Leu 
110 

Met He Leu He Phe Thr 
125 

His Asp Tyr Leu Arg Glu 
140 



wo 01/00672 




PCT/USOO/18184 



Ala 


Pro 


Glu 


Asp 


He 


Gin 


Asp 


Leu 


Met 


Asp 


He 


Phe 


Gly 


Asp 


Arg 


Tyr 


145 










150 










155 










160 


Cys 


Ala 


Leu 


Asn 


Asn 
165 


Lys 


Ala 


Thr 


Gly 


Ala 
170 


Glu 


Gin 


Glu 


Ala 


Gin 
175 


Arg 


Ala 


Gin 


Leu 


Leu 


Gly 


Leu 


He 


Gin 


Arg 


Val 


Val 


Arg 


Glu 









180 185 

<210> 31 
<211> 292 
<212> PRT 

<213> Artificial sequence 
< :: 2 0 > 

<::23> Synthetically generated peptide 



<4C0> 31 



Lys 


Gly 


Phe 


Asp 


Phe 


Thr 


Leu 


Met 


Val 


Va 1 


Gly 


Glu 


Ser 


Gly 


Leu 


Gly 


1 








5 










1 0 










15 




Lys 


Thr 


Thr 


Leu 


He 


Asn 


Thr 


Leu 


Phe 


Leu 


Thr 


Asp 


Leu 


He 


Asp 


Ala 








20 










25 










30 






Asn 


Gly 


Val 


Ala 


Asn 


Asp 


Ser 


Arg 


Glu 


He 


Asp 


Gly 


Ala 


Ser 


Glu 


Thr 






3 5 










4 0 










4 5 








Lys 


He 


Lys 


Lys 


Thr 


Val 


Glu 


He 


Lys 


Glu 


He 


Thr 


Lys 


Val 


Glu 


He 




50 










55 










60 










Glu 


Glu 


Asp 


Gly 


Val 


Lys 


Leu 


Asn 


Leu 


Thr 


Val 


He 


Asp 


Thr 


Pro 


Gly 


65 










70 










75 










80 


Phe 


Gly 


Asp 


Ala 


He 


Asp 


Asn 


Ser 


Lys 


Cys 


Trp 


Glu 


Pro 


lie 


Val 


Glu 










85 










90 










95 




Tyr 


He 


Asp 


Glu 


Gin 


His 


Glu 


Gin 


Tyr 


Leu 


Arg 


Gin 


Glu 


Ser 


Arg 


He 








ICQ 










105 










110 






Asn 


Arg 


Thr 


Lys 


He 


Val 


Asp 


Asn 


Arg 


Val 


His 


Cys 


Cys 


Leu 


Tyr 


Phe 






115 










120 










125 








He 


Ser 


Pro 


Thr 


Gly 


His 


Gly 


Leu 


Lys 


Pro 


Leu 


Asp 


Val 


Glu 


Phe 


Met 




130 










135 










140 










Lys 


Lys 


Leu 


Ser 


Glu 


Lys 


Val 


Asn 


Leu 


He 


Pro 


Val 


He 


Ala 


Lys 


Ala 


145 










150 










155 










160 


Asp 


Thr 


Leu 


Thr 


Ala 


Asp 


Glu 


Leu 


Gin 


Glu 


Phe 


Lys 


Lys 


Arg 


He 


Arg 










165 










170 










175 




Glu 


Glu 


He 


Glu 


Arg 


Gin 


Asn 


He 


Lys 


He 


Tyr 


Lys 


Phe 


Pro 


Asp 


Glu 








180 










185 










190 






Glu 


Glu 


Asp 


Glu 


Gly Asp 


Glu 


Glu 


Phe 


Lys 


Glu 


Gin 


Thr 


Gin 


Gin 


Leu 






195 










200 










205 








Lys 


Ser 


Ser 


He 


Pro 


Phe 


Ala 


He 


Val 


Gly 


Ser 


Asn 


Glu 


Glu 


He 


Glu 




210 










215 










220 










Asn 


Gly 


Asp 


Gly 


Glu 


Lys 


Val 


Arg 


Gly Arg 


Lys 


Tyr 


Pro 


Trp 


Gly 


Val 


225 










230 










235 










240 


Val 


Glu 


Val 


Glu 


Asn 


Pro 


Ser 


His 


Cys 


Asp 


Phe 


Val 


Lys 


Leu 


Arg 


Asn 










245 










250 










255 




Leu 


Leu 


He 


Arg 


Thr 


His 


Leu 


Gin 


Asp 


Leu 


Lys 


Glu 


Thr 


Thr 


Glu 


Glu 








260 










265 










270 






He 


Leu 


Tyr 


Glu 


Asn 


Tyr 


Arg 


Ser 


Glu 


Lys 


Leu 


Ser 


Ala 


Leu 


Gly 


Leu 






275 










280 










285 








Lys 


Ala 


Glu 


Asn 



























290 



<210> 32 
<211> 163 
<212> PRT 



42 
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<213> Homo sapiens 
<400> 32 

Gin Leu Arg He Val Leu Val Gly Lys Thr Gly Ala Gly Lys Ser Ala 

15 10 15 

Thr Gly Asn Ser He Leu Gly Arg Lys Val Phe His Ser Gin Thr Ala 

20 25 30 

Ala Lys Ser He Thr Lys Lys Cys Glu Lys Arg Ser Ser Ser Trp Lys 

35 40 45 

Glu Thr Glu Leu Val Val Val Asp Thr Pro Gly He Phe Asp Thr Glu 

50 55 60 

Val Pro Asn Ala Glu Thr Ser Lys Glu He He Arg Gys He Leu Leu 
6S 70 75 80 

Thr Ser Pro Gly Pro His Ala Leu Leu Leu Val Val Pro Leu Gly Arg 

85 90 95 

Tyr Thr Glu Glu Glu His Lys Ala Thr Glu Lys He Leu Lys Met Phe 

100 105 110 

Gly Glu Arg Ala Arg Ser Phe Met He Leu He Phe Thr Arg Lys Asp 

115 120 125 

Asp Leu Gly Asp Thr Asn Leu His Asp Tyr Leu Arg Glu Ala Pro Glu 

130 135 140 

Asp He Gin Asp Leu Met Asp He Phe Gly Asp Arg Tyr Cys Ala Leu 
145 150 155 160 

Asn Asn Lys 



<210> 33 
<211> 198 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetically generated peptide 
<400> 33 

Gly Glu Val Leu Ala Leu Val Gly Pro Asn Gly Ala Gly Lys Ser Thr 

15 10 15 

Leu Leu Lys Leu He Ser Gly Leu Leu Pro Pro Thr Glu Gly Thr He 

20 25 30 

Leu Leu Asp Gly Ala Arg Asp Leu Arg Leu Ser Lys Leu Lys Glu Arg 

35 40 45 

Leu Glu Arg Leu Arg Lys Asn He Gly Val Val Phe Gin Asp Pro Thr 

50 55 60 

Leu Phe Pro Asn Val Glu Leu Thr Val Arg Glu Asn He Ala Phe Gly 
65 70 75 80 

Leu Arg Leu Ser Leu Gly Leu Ser Lys Asp Glu Gin Arg Ala Arg Leu 

85 90 95 

Lys Lys Ala Gly Ala Glu Glu Leu Leu Glu Arg Leu Gly Leu Gly Tyr 

100 105 110 

Asp His Leu Leu Asp Arg Arg Pro Gly Thr Leu Ser Gly Gly Gin Lys 

115 120 125 

Gin Arg Val Ala He Ala Arg Ala Leu Leu Thr Lys Pro Lys Leu Leu 

130 135 140 

Leu Leu Asp Glu Pro Thr Ala Gly Leu Asp Pro Ala Ser Arg Ala Gin 
145 150 155 160 

Leu Leu Glu Leu Leu Arg Glu Leu Arg Gin Gin Gly Gly Thr Val Leu 
165 170 175 



43 
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Leu He Thr His Asp Leu Asp Leu Leu Asp Arg Leu Ala Asp Arg He 

180 185 190 

Leu Val Leu Glu Asp Gly 
195 

<210> 34 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 34 

Cys Pro Leu Pro Cys Met Asn Gly Gly Gin Cys Ser Ser Arg Asn Gin 

15 10 15 

Cys Leu Cys Pro Pro Asp Phe Thr Gly Arg Phe Cys 
20 25 

<210> 35 
<2i:> 36 
<212> PRT 

<213> Homo sapiens 
<400> 35 

Cys Ala Met Pro Gly Val Cys Arg His Gly Asp Cys Leu Asn Asn Pro 

15 10 15 

G":y Ser Tyr Arg Cys Val Cys Pro Pro Gly His Ser Leu Gly Pro Ser 

20 25 30 

Arg Thr Gin Cys 
35 

<210> 36 
<211> 37 
<212> PRT 

<213> Homo sapiens 
<400> 3 6 

Cys Arg Leu Asn Gin Asn He Cys Gly His Gly Glu Cys Val Pro Gly 

15 10 15 

Pro Pro Asp Tyr Ser Cys His Cys Asn Pro Gly Tyr Arg Ser His Pro 

20 25 30 

Gin His Arg Tyr Cys 
35 

<210> 37 
<211> 39 
<212> PRT 

<213> Homo sapiens 
<400> 37 

Cys Glu Ala Glu Pro Cys Gly Pro Gly Arg Gly He Cys Met Asn Thr 

1 5 10 15 

Gly Gly Ser Tyr Asn Cys His Cys Asn Arg Gly Tyr Arg Leu His Val 

20 25 30 

Gly Ala Gly Gly Arg Ser Cys 
35 

<210> 38 
<211> 38 
<212> PRT 
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<213> Homo sapiens 
<400> 38 

Cys Ala Lys Pro His Leu Cys Gly Asp Gly Gly Phe Cys lie Asn Phe 

15 10 15 

Pro Gly His Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg Leu Lys Ala 

20 25 30 

Ser Arg Pro Pro Val Cys 
35 

<210> 39 
<211> 36 
<212> PRT 

<213> Homo sapiens 
<400> 39 

Cys Arg Asp Pro Ser Ser Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro 

1.5 10 15 

Gly Ser Phe Lys Cys lie Ala Cys Gin Pro Gly Tyr Arg Ser Gin Gly 

20 25 30 

Gly Gly Ala Cys 
35 

<210> 40 
<211> 36 
<212> PRT 

<213> Hono sapiens 
<400> 40 

Cys Ala Glu Gly Ser Pro Cys Ser Pro Gly Trp Cys Glu Asn Leu Pro 

1.5 10 15 

Gly Ser Phe Arg Cys Thr Cys Ala Gin Gly Tyr Ala Pro Ala Pro Asp 

20 25 30 

Gly Arg Ser Cys 
35 

<210> 41 
<211> 36 
<212> PRT 

<213> Homo sapiens 
<400> 41 

Cys Glu Ala Gly Asp Val Cys Asp Asn Gly lie Cys Ser Asn Thr Pro 

15 10 15 

Gly Ser Phe Gin Cys Gin Cys Leu Ser Gly Tyr His Leu Ser Arg Asp 

20 25 30 

Arg Ser His Cys 
35 

<210> 42 
<211> 35 
<212> PRT 

<213> Homo sapiens 
<400> 42 

Cys Asp Phe Pro Ala Ala Cys He Gly Gly Asp Cys He Asn Thr Asn 
15 10 15 



45 
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Gly Ser Tyr Arg Cys Leu Cys Pro Gin Gly His Arg Leu Val Gly Gly 
20 25 30 

Arg Lys Cys 
35 

<210> 43 
<211> 38 
<212> PRT 

<213> Homo sapiens 
<400> 43 

Cys Ser Gin Asp Pro Ser Leu Cys Leu Pro His Gly Ala Cys Lys Asn 

15 10 15 

Leu Gin Gly Ser Tyr Val Cys Val Cys Asp Glu Gly Phe Thr Pro Thr 

2 0 2 5 3 0 

Gin Asp Gin His Gly Cys 
35 

<210> 44 
<211> 38 
<212> PRT 

<213> Homo sapiens 
<400> 44 

Cys Met Leu Phe Gly Ser Glu He Cys Lys Glu Gly Lys Cys Val Asn 

15 10 15 

Thr Gin Pro Gly Tyr Glu Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp 

20 25 30 

Gly Asn Leu Leu Glu Cys 
35 

<210> 45 
<211> 35 
<212> PRT 

<213> Homo sapiens 
<400> 45 

Cys Leu Asp Glu Ser Asn Cys Arg Asn Gly Val Cys Glu Asn Thr Arg 

15 10 15 

Gly Gly Tyr Arg Cys Ala Cys Thr Pro Pro Ala Glu Tyr Ser Pro Ala 
20 25 30 

Gin Arg Gin 
35 

<210> 46 
<211> 36 
<212> PRT 

<213> Homo sapiens 
<400> 46 

Cys Gin Asp Pro Ala Ala Cys Arg Pro Gly Arg Cys Val Asn Leu Pro 

15 10 15 

Gly Ser Tyr Arg Cys Glu Cys Arg Pro Pro Trp Val Pro Gly Pro Ser 

20 25 30 

Gly Arg Asp Cys 
35 

<210> 47 



46 
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<211> 36 
<212> PRT 

<213> HoTLO sapiens 
<400> 47 

Asp Ser Asp Glu Cys Arg Cys Val Ser Gly Arg Cys Val Pro Arg Pro 

15 10 15 

Gly Gly Ala Ala Cys Glu Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser 
20 25 30 



wo 01/00672 PCT/US00/18I84 



<400> 51 

His Gin Cys Gin His Pro Leu Thr Thr Arg Leu Thr Arg Gin Leu Cys 

15 10 15 

Cys Cys Ser Val Gly Lys Ala Trp Gly Ala Arg Cys Gin Arg Cys Pro 

20 25 30 

Thr Asp Gly Thr Ala Ala Phe Lys Glu lie 
35 40 

<210> 52 
<211> 44 
<212> PRT 

<213> Homo sapiens 



<400> 52 

Val Phe Cys Asp Ser Val Leu Ala Thr Asn Val Thr Gin Gin Glu Cys 

15 10 15 

Cys Cys Ser Leu Gly Ala Gly Trp Gly Asp His Cys Glu lie Tyr Pro 

20 25 30 

Cys Pro Val Tyr Ser Ser Ala Glu Phe His Ser Leu 
35 40 



<210> 53 

<211> 46 

<212> PRT 

<213> Homo sapiens 



<400> 53 


























Gly Met Cys 


Ala 


Gly 


Pro 


Leu 


Ala 


Gly 


Pro Ala 


Leu 


Thr 


Phe 


Asp 


Asp 


1 




5 










10 








15 




Cys Cys Cys 


Arg 
20 


Gin 


Gly 


Arg 


Gly 


Trp 
25 


Gly Ala 


Gin 


Cys 


Arg 
30 


Pro 


Cys 


Pro Pro Arg 


Gly 


Ala 


Gly 


Ser 


His 


Cys 


Pro Thr 


Ser 


Gin 


Ser 






35 










40 








45 









<210> 54 

<211> 45 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetically generated peptide 



<400> 54 

Gly Arg Cys Ser Asn Pro Leu Pro 

1 5 
Cys Cys Cys Ser Val Gly Arg Gly 
20 

Leu Cys Pro Val Pro Gly Thr Ala 
35 40 



Gly Arg Ala Val Thr Lys Ser Glu 

10 15 
Glu Ala Trp Gly Thr Pro Cys Glu 

25 30 
Glu Phe Lys Glu Leu 
45 



<210> 55 

<211> 65 

<212> PRT 

<213> Homo sapiens 



<400> 55 

Arg Asp Pro Ser Ser Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro Gly 
15 10 15 



48 
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Ser Phe Lys Cys lie Ala Cys Gin Pro Gly Tyr Arg Ser Gin Gly Gly 

20 25 30 

Gly Ala Cys Arg Asp Val Asn Glu Cys Ala Glu Gly Ser Pro Cys Ser 

35 40 45 

Pro Gly Trp Cys Glu Asn Leu Pro Gly Ser Phe Arg Cys Thr Cys Ala 
50 55 60 

Gin 
65 

<210> 56 
<211> 67 
<212> PRT 

<2 13 > Art i f icial sequence 



<220> 

< 2 2 3 > Synthetically generated peptide 
<400> 56 

Met Asp Pro Gin Asn Cys Ser Cys Ala Thr Gly Gly Ser Cys Thr Cys 

15 10 15 

Gly Thr Ser Cys Lys Cys Lys Asn Cys Lys Cys Thr Ser Cys Lys Lys 

20 25 30 

Ser Cys Cys Ser Cys Cys Pro Ala Gly Cys Ser Lys Cys Ala Gly Gly 

35 40 45 

Cys Val Cys Lys Gly Gly Gly Ala Ala Ser Glu Thr Ser Lys Cys Ser 

50 55 60 

Cys Cys Ala 
6 5 



<210> 57 
<211> 305 
<212> PRT 

< 2 1 3 > Homo sapiens 



<400> 57 



Met 


Ala 


Ala 


Gin 


Tyr 


Gly 


Ser 


Met 


Ser 


Phe 


Asn 


Pro 


Ser 


Thr 


Pro 


Gly 


1 








5 










10 










15 




Ala 


Ser 


Tyr 


Gly 
20 


Pro 


Gly 


Arg 


Gin 


Glu 
25 


Pro 


Arg 


Asn 


Ser 


Gin 
30 


Leu 


Arg 


I]e 


Val 


Leu 

35 


Val 


Gly 


Lys 


Thr 


Gly 
40 


Ala 


Gly 


Lys 


Ser 


Ala 
45 


Thr 


Gly 


Asn 


Ser 


lie 


Leu 


Gly 


Arg 


Lys 


Val 


Phe 


His 


Ser 


Gly Thr Ala 


Ala 


Lys 


Ser 




50 










55 










60 










He 


Thr 


Lys 


Lys 


Cys 


Glu 


Lys 


Arg 


Ser 


Ser 


Ser 


Trp 


Lys 


Glu 


Thr 


Glu 


65 










70 










75 










80 


Leu 


Val 


Val 


Val 


Asp 
85 


Thr 


Pro 


Gly 


He 


Phe 
90 


Asp 


Thr 


Glu 


Val 


Pro 
95 


Asn 


Ala 


Glu 


Thr 


Ser 

100 


Lys 


Glu 


He 


He 


Arg 
105 


Cys 


He 


Leu 


Leu 


Thr 
110 


Ser 


Pro 


Gly 


Pro 


His 


Ala 


Leu 


Leu 


Leu 


Val 


Val 


Pro 


Leu Gly 


Arg 


Tyr 


Thr 


Glu 






115 










120 










125 








Glu 


Glu 
130 


His 


Lys 


Ala 


Thr 


Glu 
135 


Lys 


He 


Leu 


Lys 


Met 
140 


Phe 


Gly 


Glu 


Arg 


Ala 


Arg 


Ser 


Phe 


Met 


He 


Leu 


He 


Phe 


Thr 


Arg 


Lys 


Asp 


Asp 


Leu 


Gly 


145 










150 










155 










160 


Asp 


Thr 


Asn 


Leu 


His 
165 


Asp 


Tyr 


Leu 


Arg 


Glu 
170 


Ala 


Pro 


Glu 


Asp 


He 
175 


Gin 
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Asp 


Leu 


Met 


Asp 
180 


lie 


Phe 


Gly 


Asp 


Arg 
185 


Tyr 


Cys 


Ala 


Leu 


Asn 
190 


Asn 


Lys 


Ala 


Thr 


Giy Ala 


Glu 


Gin 


Glu 


Al a 


Gin 


Arg 


Ala 


Gin 


Leu 


Leu 


Gly 


Leu 






19 5 










200 










205 








lie 


Gin 
210 


Arg 


Val 


Val 


Arg 


Glu 
215 


Asn 


Lys 


Glu 


Gly 


Cys 
220 


Tyr 


Thr 


Asn 


Arg 


Met 


Tyr 


Gin 


Arg 


Ala 


blU 


blU 


blU 


lie 


Gin 


Lys 


Gin 


Thr 


Gin 


Ala 


Met 


225 










230 










235 










240 


Gin 


Glu 


Leu 


His 


Arg 
245 


Val 


Glu 


Leu 


Glu 


Arg 
250 


Glu 


Lys 


TV 1 

Ala 


Arg 


lie 
255 


Arg 


Glu 


Glu 


Tyr 


Glu 
260 


Glu 


Lys 


lie 


Arg 


Lys 
265 


Leu 


Glu 


Asp 


Lys 


Val 
270 


Glu 


Gin 


Glu 


Lys 


Arg 
275 


Lys 


Lys 


Gin 


Met 


Glu 
280 


Lys 


Lys 


Leu 


Ala 


Glu 
285 


Gin 


Glu 


Ala 


His 


Tyr 
290 


Ala 


Val 


Arg 


Gin 


Gin 
295 


Arg 


Ala 


Arg 


Thr 


Glu 
300 


Val 


Glu 


Ser 


Lys 


Asp 
































305 

































<210> 58 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 58 

Gly lie Leu Glu Leu 
1 5 
Leu Leu 



lie Met Thr Ala Leu Gin He Ala Ser Phe He 
10 15 



<210> 59 
<211> 6 
<212> PRT 

<2 1 3 > Homo sapiens 
<400> 59 

Arg Leu Phe Ala Glu Asp 
1 5 

<210> 60 
<211> 29 
<212> PRT 

<213> Homo sapiens 
<400> 60 

Met Arg Gly Ala Gly Ala Ala Gly Leu Leu Ala Leu Leu Leu Leu Leu 

15 10 15 

Leu Leu Leu Leu Leu Gly Leu Gly Gly Arg Val Glu Gly 
20 25 

<210> 61 
<211> 1260 
<212> PRT 

<213> Homo sapiens 
<400> 61 

Gly Pro Ala Gly Glu Arg Gly Ala Gly Gly Gly Gly Ala Leu Ala Arg 
15 10 15 



50 
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Glu Arg Phe Lys Val Val Phe Ala Pro Val lie Cys Lys Arg Thr Cys 

20 25 30 

Leu Lys Gly Gin Cys Arg Asp Ser Cys Gin Gin Gly Ser Asn Met Thr 

35 40 45 

Leu He Gly Glu Asn Gly His Ser Thr Asp Thr Leu Thr Gly Ser Gly 

50 55 60 

Phe Arg Val Val Val Cys Pro Leu Pro Cys Met Asn Gly Gly Gin Cys 
65 70 75 80 

Ser Ser Arg Asn Gin Cys Leu Cys Pro Pro Asp Phe Thr Gly Arg Phe 

85 90 95 

Cys Gin Val Pro Gin Ala Gly Gly Ala Gly Gly Gly Thr Gly Gly Ser 

100 105 110 

Gly Pro Gly Leu Ser Arg Thr Gly Ala Leu Ser Thr Gly Ala Leu Pro 

115 120 125 

Pro Leu Ala Pro Glu Gly Asp Ser Val Ala Ser Lys His Ala He Tyr 

130 135 140 

Ala Val Gin Val He Ala Asp Pro Pro Gly Pro Gly Glu Gly Pro Pro 
145 150 155 160 

Ala Gin His Ala Ala Phe Leu Val Pro Leu Gly Pro Gly Gin He Ser 

165 170 175 

Ala Glu Val Gin Ala Pro Pro Pro Val Val Asn Val Arg Val His His 

180 185 190 

Pro Pro Glu Ala Ser Val Gin Val His Arg He Glu Ser Ser Asn Ala 

195 200 205 

Glu Ser Ala Ala Pro Ser Gin His Leu Leu Pro His Pro Lys Pro Ser 

210 215 220 

His Pro Arg Pro Pro Thr Gin Lys Pro Leu Gly Arg Cys Phe Gin Asp 
225 230 235 240 

Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn Pro Leu Pro Gly Leu Thr 

245 250 255 

Lys Gin Glu Asp Cys Cys Gly Ser He Gly Thr Ala Trp Gly Gin Ser 

260 265 270 

Lys Cys His Lys Cys Pro Gin Leu Gin Tyr Thr Gly Val Gin Lys Pro 
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Glu Glu Arg Gly Val Thr Thr Asp Ser Pro Val Ser Glu Glu Arg Ser 
465 470 475 480 



51 



wo 01/00672 




PCT/l]S0(»/18!«4 



V d. 1 


± 11 


n 


C a V- 

be r 


HI S 


Pro 


i nr 


HI a 


i iir 


i n r 


i nr 


Pro 


a 1 3 
Al a 


Arg 


Pro 


Tyr 










A ft 










*4 J? U 










A Q ^ 
'± y D 




Pro 




Leu 


lie 


Ser 


Arg 


Pro 


Ser 


Pro 


Pro 


i nr 


rie c 


Arg 


Trp 


f ne 


Leu 








n n 
D u u 










c; n R 

J U 3 










510 






Pro 


Asp 


Leu 


Pro 


Pro 


OCT 


Arg 




2X 1 3 
rt.X a. 


V d J. 


n 1 n 
w i. U 


1 X c 


A X d 


P ro 


1 ilX 


Pi n 






DID 










5 2 0 










52 5 








1 


i nr 


GIU 


i nr 


Asp 


J- U 


Cys 


Arg 


Leu 


Asn 


o in 


As n 


T 1 o 

X X e 


Cys 


P 1 -v/ 

o X y 


nx o 




D O U 










5 3 5 










54 0 










Gly 


Glu 


Cys 


va 1 


Pro 


tj±y 


Pro 


Pro 


Asp 


Tyr 


Ser 


Cys 


MIS 


Cys 


Asn 


Pro 


t /I c 

D 4 Id 










n 

D D u 










t; c; 

3 D D 










c; n 

D D u 


Gly 


Tyr 


Arg 


Ser 


His 


Pro 


Gin 


His 


Arg 


Tyr 


Cys 


val 


Asp 


1 7 1 

Val 


Asn 


Glu 










c c c 

_J D 3 










R 7 n 










c; T 




Cys 


Glu 


Ala 


Glu 


Pro 


Cys 


uiy 


Pro 


Gly 


Arg 


Lily 


i le 


Cys 


Met 


Asn 


Tnr 








J o u 










c; P 

O C3 D 










c; Q n 
J ^ u 






Gly 


Gly 


Ser 


Tyr 


Asn 


Cys 


His 


Cys 


Asn 


Arg 


Gly 


Tyr 


Arg 


Leu 


His 


^ 7 1 

V ai 






c: Q c 
D -7 D 










D U V 










n ^ 

DUD 








Gly 


Ala 


Gly 


Gly 


Arg 


Ser 


Cys 


V a 1 


Asp 


Leu 


As n 


Pin 

Ul u 


Cys 


A 1 = 

Ala 


Lys 


Pro 




610 










615 










6 2 0 










His 


Leu 


Cys 


Liiy 


Asp 


Giy 


uiy 


irne 


Cys 


lie 


Asn 


Phe 


Pro 


Gi y 


His 


Tyr 


f=; 9 ^ 
u 3 










T n 

D J U 










O J J 










A n 

D U 


Lys 


Cys 


Asn 


Cys 


Tyr 


Pro 


(jiy 


Tyr 


Arg 


Leu 


Lys 


Aia 


Ser 


Arg 


Pro 


Pro 










j1 c: 

b 1 D 










c c; n 
b 3> U 










c c c: 

ODD 




Vai 


Cys 


Glu 


Asp 


He 


Asp 


Glu 


Cys 


Arg 


Asp 


Pro 


Ser 


Ser 


Cys 


Pro 


Asp 








ez c r\ 
b D U 










S c; 

O O D 










c 7 n 
b / U 






Gly 


Lys 


Cys 


Glu 


Asn 


Lys 


Pro 


Gly 


Ser 


pne 


Lys 


Cys 


T 1 & 

lie 


A 1 = 

Ala 


Cys 


Gin 






7 c; 
O 1 D 










3 n 

D O U 










ft c: 

D O D 








Pro 


Gly 


Tyr 


Arg 


Ser 


Gin 


Liiy 


Liiy 


G ly 


Al a 


Cys 


Arg 


Asp 


va 1 


Asn 


Glu 




O J? u 










6 95 










7 0 0 










Cys 


Til =i 


o 1 U 


±y 


Ser 


Pro 


Cys 


Ser 


Pro 


± y 


Trp 


Cys 


Pin 
O 1 U 


Asn 


Leu 


Pro 


7 n R 










710 










715 










7 2 0 


Gly 


Ser 


Pne 


Arg 


Cys 


i nr 


Cys 


rt.1 a 


ui n 


ly 


Tyr 


Ala 


Pro 


a 1 :3 

Al a 


Pro 


Asp 










7 C 










7^0 










7 1 ^ 




Gly 


Arg 


Ser 


Cys 


Leu 


Asp 


Val 


Asp 


Glu 


Cys 


Glu 


Ala 


Gly 


Asp 


Va 1 


Cys 








74 0 










74 5 










7 5 0 






Asp 


Asn 


Gly 


He 


Cys 


Ser 


Asn 


Thr 


Pro 


Gly 


Ser 


Phe 


Gin 


Cys 


Gin 


Cys 






7 5 5 










1 a r\ 
/ b U 










/Ob 








Leu 


Ser 


Gly 


Tyr 


His 


Leu 


Ser 


Arg 


Asp 


Arg 


Ser 


His 


Cys 


Glu 


Asp 


He 




7 7 0 










inc. 

1 1 D 










Ton 










Asp 


Glu 


Cys 


Asp 


Phe 


Pro 


TV T 

Al a 


Ala 


Cys 


lie 


Gly 


Gly 


Asp 


Cys 


I le 


Asn 


"7 Q 










7 Q n 










7 Q R 
1 jD 










fl n n 
D u u 


Thr 


Asn 


Gly 


Ser 


Tyr 


Arg 


Cys 


Leu 


Cys 


Pro 


nl T^ 

Gin 


Gly 


His 


Arg 


Leu 


va 1 










8 05 










o i U 










O 1 D 




Gly 


Gly 


Arg 


Lys 


Cys 


Gin 


Asp 


Tic 

lie 


Asp 


Glu 


Cys 


Ser 


Gin 


Asp 


Pro 


Ser 








Q O A 
O Z U 










Q 7 t; 
0 z 3 










Q "J n 
O J U 






Leu 


Cys 


Leu 


Pro 


His 


Gly 


Ala 


Cys 


Lys 


Asn 


Leu 


oil- 
Gin 


Gly 


Ser 


Tyr 


1 7 1 

Val 






a ~i c 

O J J 










fi A n 

D U 










R A 
O *t J 








Cys 


Val 


Cys 


Asp 


Glu 


Gly 


Phe 


Thr 


Pro 


Thr 


Gin 


Asp 


Gin 


Hi s 


Gly 


Cys 




8 5 0 










85 5 










860 










Glu 


Glu 


Val 


Glu 


Gin 


Pro 


His 


His 


Lys 


Lys 


Glu 


Cys 


Tyr 


Leu 


Asn 


Phe 


Q i^cz 

O O J 










p "7 n 










875 










p R n 
o o u 


Asp 


Asp 


Thr 


Val 


Phe 


Cys 


Asp 


Ser 


Val 


Leu 


Ala 


Thr 


Asn 


Val 


Thr 


Gin 










885 










890 










895 




Gin 


Glu 


Cys 


Cys 


Cys 


Ser 


Leu 


Gly 


Ala 


Gly 


Trp Gly 


Asp 


His 


Cys 


Glu 








900 










905 










910 






He 


Tyr 


Pro 


Cys 


Pro 


Val 


Tyr 


Ser 


Ser 


Ala 


Glu 


Phe 


His 


Ser 


Leu 


Cys 






915 










920 










925 








Pro 


Asp 


Gly 


Lys 


Gly 


Tyr 


Thr 


Gin 


Asp 


Asn 


Asn 


He 


Val 


Asn 


Tyr 


Gly 




930 










935 










940 











52 



wo 01/00672 




PCT/USOO/18184 



T 1 o 

lie 


Pro 


Ai a 


His 


Arg 


Asp 


1 J- e 


Asp 


Glu 


Cys 


Met 


Leu 


Phe 


Gly 


OC L 




Q d. c, 
J? *i D 










Q n 

J? D u 










Q ^ R 

J? ,3 D 










960 


lie 


Cys 


Lys 


Glu 


Gly 
yob 


Lys 


Cys 


Val 


Asn 


Thr 
Q "7 n 


in 


Pro 


Gly 


Tyr 


UlU 

QH ^ 

y / D 


Cys 


Tyr 


Cys 


Lys 


Gin 
Q Q n 

you 


Gly 


Phe 


Tyr 


Tyr 


Asp 
985 


uiy 


Asn 


Leu 


Leu 


Glu 
990 


Cys 


V a 1 


Asp 


Val 


Asp 


Glu 


Cys 


Leu 


Asp 


Glu 


Ser 


Asn 


Cys 


Arg 


Asn 


Gly 


Val 


Cys 






995 










1000 








1005 






Glu 


Asn 


Thr 


Arg 


Gly 


Gly 


Tyr 


Arg 


Cys 


Ala 


Cys 


Thr 


Pro 


Pro 


Ala 


Glu 




1010 








1015 








1020 








Tyr 


Ser 


Pro 


Ala 


Gin 


Arg 


Gin 


Cys 


Leu 


Ser 


Pro 


Glu 


Glu 


Met 


Asp 


Val 


1025 








1030 








1035 








1040 


Asp 


Glu 


Cys 


Gin 


Asp 


Pro 


Ala 


Ala 


Cys 


Arg 


Pro 


Gly Arg 


Cys 


Val 


Asn 










1045 








1050 








1055 


Leu 


Pro 


Gly 


Ser 


Tyr 


Arg 


Cys 


Glu 


Cys 


Arg 


Pro 


Pro 


Trp 


Val 


Pro 


Gly 








1060 








1065 








1070 




Pro 


Ser 


Gly 


Arg 


Asp 


Cys 


Gin 


Leu 


Pro 


Glu 


Ser 


Pro 


Ala 


Glu 


Arg 


Ala 






1075 








1080 








1085 






Pro 


Glu 


Arg 


Arg 


Asp 


Val 


Cys 


Trp 


Ser 


Gin 


Arg 


Gly Glu 


Asp 


Gly Met 




1090 








1095 








1100 








Cys 


Ala 


Gly 


Pro 


Leu 


Ala 


Gly 


Pro 


Ala 


Leu 


Thr 


Phe 


Asp 


Asp 


Cys 


Cys 


1105 








1110 








1115 








1120 


Cys 


Arg 


Gin 


Gly Arg 


Gly 


Trp Gly 


Ala 


Gin 


Cys 


Arg 


Pro 


Cys 


Pro 


Pro 










1125 








1130 








1135 


Arg 


Gly Ala 


Gly 


Ser 


His 


Cys 


Pro 


Thr 


Ser 


Gin 


Ser 


Glu 


Ser 


Asn 


Ser 








1140 








1145 








1150 




Phe 


Trp 


Asp 


Thr 


Ser 


Pro 


Leu 


Leu 


Leu 


Gly 


Lys 


Pro 


Pro 


Arg 


Asp 


Glu 






1155 








1160 








1165 






Asp 


Ser 


Ser 


Glu 


Glu 


Asp 


Ser 


Asp 


Glu 


Cys 


Arg 


Cys 


Val 


Ser 


Gly 


Arg 




1170 








1175 








1180 








Cys 


Val 


Pro 


Arg 


Pro 


Gly 


Gly 


Ala 


Ala 


Cys 


Glu 


Cys 


Pro 


Gly 


Gly 


Phe 


1185 








1190 








1195 








1200 


Gin 


Leu 


Asp 


Ala 


Ser 


Arg 


Ala 


Arg 


Cys 


Val 


Asp 


He 


Asp 


Glu 


Cys 


Arg 










1205 








1210 








1215 


Glu 


Leu 


Asn 


Gin 


Arg 


Gly 


Leu 


Leu 


Cys 


Lys 


Ser 


Glu 


Arg 


Cys 


Val 


Asn 








1220 








1225 








1230 




Thr 


Ser 


Gly 


Ser 


Phe 


Arg 


Cys 


Val 


Cys 


Lys 


Ala 


Gly 


Phe 


Ala 


Arg 


Ser 






1235 








1240 








1245 






Arg 


Pro 


His 


Gly 


Ala 


Cys 


Val 


Pro 


Gin 


Arg 


Arg 


Arg 











1250 1255 1260 



<210> 62 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 62 

Met Ala Ser Leu Gly Leu Leu Leu Leu Leu Leu Leu Thr Ala Leu Pro 

15 10 15 

Pro Leu Trp Ser 
20 

<210> 63 

<211> 341 

<212> PRT 
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<400> 63 
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Ser Ser Leu Pro Giy Leu Asp Thr Ala Glu Ser Lys Ala Thr lie Ala 

15 10 15 

Asp Leu lie Leu Ser Ala Leu Glu Arg Ala Thr Val Phe Leu Glu Gin 

20 25 30 

Arg Leu Pro Glu lie Asn Leu Asp Gly Met Val Gly Val Arg Val Leu 

35 40 45 

Glu Glu Gin Leu Lys Ser Val Arg Glu Lys Trp Ala Gin Glu Pro Leu 

50 55 60 

Leu Gin Pro Leu Ser Leu Arg Val Gly Met Leu Gly Glu Lys Leu Glu 
65 70 75 80 

Ala Ala lie Gin Arg Ser Leu His Tyr Leu Lys Leu Ser Asp Pro Lys 

85 90 95 

Tyr Leu Arg Glu Phe Gin Leu Thr Leu Gin Pro Gly Phe Trp Lys Leu 

100 105 110 

Pro His Ala Trp lie His Thr Asp Ala Ser Leu Val Tyr Pro Thr Phe 

115 120 125 

Gly Pro Gin Asp Ser Phe Ser Glu Glu Arg Ser Asp Val Cys Leu Val 

130 135 140 

Gin Leu Leu Gly Thr Gly Thr Asp Ser Ser Glu Pro Cys Gly Leu Ser 
145 150 155 160 

Asp Leu C/s Arg Ser Leu Met Thr Lys Pro Gly Cys Ser Gly Tyr Cys 

165 170 175 

Leu Ser His Gin Leu Leu Phe Phe Leu Trp Ala Arg Met Arg Gly Cys 

180 185 190 

Thr Gin Gly Pro Leu Gin Gin Ser Gin Asp Tyr lie Asn Leu Phe Cys 

195 200 205 

Ala Asn r<et Met Asp Leu Asn Arg Arg Ala Glu Ala lie Gly Tyr Ala 

210 215 220 

Tyr Pro Thr Arg Asp lie Phe Met Glu Asn lie Met Phe Cys Gly Met 
225 230 235 240 

Gly Gly Phe Ser Asp Phe Tyr Lys Leu Arg Trp Leu Glu Ala lie Leu 

245 250 255 

Ser Trp Gin Lys Gin Gin Glu Gly Cys Phe Gly Glu Pro Asp Ala Glu 

260 265 270 

Asp Glu Glu Leu Ser Lys Ala lie Gin Tyr Gin Gin His Phe Ser Arg 

275 280 285 

Arg Val Lys Arg Arg Glu Lys Gin Phe Pro Asp Gly Cys Ser Ser His 

290 295 300 

Asn Thr Ala Thr Ala Val Ala Ala Leu Gly Gly Phe Leu Tyr lie Leu 
305 310 315 320 

Ala Glu Tyr Pro Pro Ala Asn Arg Glu Pro His Pro Ser Thr Pro Pro 

325 330 335 

Pro Pro Ser Ser Arg 
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<211> 19 
<212> PRT 

<213> Mus musculus 
<400> 64 

Met Ala Arg Leu Gly Leu Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 
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Asp 


Phe 


Asp 


Ala 


Ala 


Ala 


Gly 


Val 


Phe 


Arg 






195 










200 










205 








Cys 


Arg 


Leu 


Pro 


Gly 


Ala 


Tyr 


Phe 


Phe 


Ser 


Phe 


Thr 


Leu 


Gly 


Lys 


Leu 




210 










215 










220 










Pro 


Arg 


Lys 


Thr 


Leu 


Ser 


Val 


Lys 


Leu 


Met 


Lys 


Asn 


Arg 


Asp 


Glu 


Val 


225 










230 










235 










240 


Gin 


Ala 


Met 


lie 


Tyr 


Asp 


Asp 


Gly Ala 


Ser 


Arg 


Arg 


Arg 


Glu 


Met 


Gin 










245 










250 










255 




Ser 


Gin 


Ser 


Val 


Met 


Leu 


Ala 


Leu 


Arg 


Arg 


Gly 


Asp 


Ala 


Val 


Trp 


Leu 








260 










265 










270 






Leu 


Ser 


His 


Asp 


His 


Asp 


Gly 


Tyr 


Gly 


Ala 


Tyr 


Ser 


Asn 


His 


Asp 


Leu 






275 










280 










285 








Pro 


Thr 


Asp 


Leu 


Lys 


Thr 


Val 


Leu 


Pro 


Ser 


Trp 


Asp 


Val 


His 


Cys 


Cys 




290 










295 










300 










Gin 


Val 


Asn 


Gin 


Arg 


Phe 


Glu 


Leu 


Cys 


He 


Gly 


Val 


He 


Pro 


Glu 


Glu 


305 










310 










315 










320 



Ser Gin His Trp Asp Asp Ala He Arg Met Asp Thr Asp Leu 



<210> 68 

<211> 17 

<212> PRT 

<213> Catarrhini 

<400> 68 



1 



5 



10 



325 



330 
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Met Leu Pro Leu Leu Leu Gly Leu Leu Gly Pro Ala Ala Cys Trp Ala 

15 10 15 

Leu 



<210> 69 
<211> 130 
<212> PRT 
<213> Catarrhini 



<400> 69 



Gly 


Pro 


Ala 


Pro 


Gly 


Pro 


Gly 


Ser 


Ser 


Glu 


Leu 


Arg 


Ser 


Ala 


Phe 


Ser 


1 








5 










10 










15 




Ala 


Ala 


Arg 


Thr 
20 


Thr 


Pro 


Leu 


Glu 


Gly 
25 


Ala 


Ser 


Glu 


Met 


Ala 
30 


Val 


Thr 


Phe 


Asp 


Lys 
35 


Val 


Tyr 


Val 


Asn 


He 
40 


Gly 


Gly 


Asp 


Phe 


Asp 
45 


Ala 


Ala 


Thr 


Gly 


Gin 


Phe 


Arg 


Cys 


Arg 


Val 


Pro 


Gly Ala 


Tyr 


Phe 


Phe 


Ser 


Phe 


Thr 




50 










55 










60 










Val 


Gly 


Lys 


Ala 


Pro 


His 


Lys 


Ser 


Leu 


Ser 


Val 


Met 


Leu 


Ala 


Leu 


Arg 


65 










70 










75 










80 


Arg 


Gly 


Asp 


Ala 


Val 
85 


Trp 


Leu 


Leu 


Ser 


His 
90 


Asp 


His 


Asp 


Gly 


Tyr 
95 


Gly 


Ala 


Tyr 


Ser 


Asn 
100 


His 


Gly 


Lys 


Tyr 


He 
105 


Thr 


Phe 


Ser 


Gly 


Phe 
110 


Leu 


Val 


Tyr 


Pro 


Asp 
115 


Leu 


Ala 


Gly 


Gly 


Ala 
120 


Pro 


Pro 


Gly 


Leu 


Gly 
125 


Ala 


Pro 


Glu 



Leu Leu 
130 



<210> 70 

<211> 126 

<212> PRT 

<213> Catarrhini 



<400> 70 



Ala 


Phe 


Ser 


Ala 


Ala 


Arg 


Thr 


Thr 


1 








5 








Ala 


Val 


Thr 


Phe 
20 


Asp 


Lys 


Val 


Tyr 


Val 


Ala 


Thr 
35 


Gly 


Gin 


Phe 


Arg 


Cys 
40 


Ser 


Phe 
50 


Thr 


Ala 


Gly 


Lys 


Ala 
55 


Pro 


Val 


Arg 


Asn 


Arg 


Asp 


Glu 


Val 


Gin 


65 










70 






Arg 


Pro 


Gly 


Ala 


Arg 
85 


Arg 


Ala 


Ala 


Asp 


Tyr 


Gly 


Asp 
100 


Thr 


Val 


Trp 


Leu 


Ala 


Leu 


Gly 


Ala 


Pro 


Gly Ala 


Thr 






115 










120 



Pro Leu Glu Gly Thr Ser Glu Met 

10 15 
Val Asn He Gly Gly Asp Phe Asp 

25 30 
Arg Val Pro Gly Ala Tyr Phe Phe 
45 

His Lys Ser Leu Ser Val Met Leu 

60 

Ala Leu Ala Phe Asp Glu Gin Arg 
75 80 
Ser Gin Ser Ala Met Leu Gin Leu 

90 95 
Arg Leu His Gly Ala Pro Gin Tyr 
105 110 
Phe Ser Gly Tyr Leu Val 
125 



<210> 71 
<211> 117 
<212> PRT 
<213> Catarrhini 
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<400> 71 

Ala Phe Ser Ala Ala Arg Thr Arg Ser Leu Val Gly Ser Asp Ala Gly 

15 10 15 

Pro Gly Pro Arg His Gin Pro Leu Ala Phe Asp Thr Glu Phe Val Asn 

20 25 30 

lie Gly Gly Asp Phe Asp Ala Ala Ala Gly Val Phe Arg Cys Arg Leu 

35 40 45 

Pro Gly Ala Tyr Phe Phe Ser Phe Thr Leu Gly Lys Leu Pro Arg Lys 

50 55 60 

Thr Leu Ser Val Lys Leu Met Lys Asn Arg Asp Glu Val Gin Ala Met 
65 70 75 80 

lie Tyr Asp Asp Gly Ala Ser Arg Arg Arg Glu Met Gin Ser Gin Ser 

85 90 95 

Val Met Leu Ala Leu Arg Arg Gly Asp Ala Val Trp Leu Leu Ser His 

100 105 110 

Asp His Asp Gly Tyr 
115 



<210> 72 

<211> 137 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetically generated peptide 

<400> 72 



Ala 


Phe 


Thr 


Val 


Leu 


Arg 


Ser 


Thr 


Asn 


Arg 


Pro 


Pro 


Ala 


Glu 


Met 


Ser 


1 








5 










10 










15 




Asn 


Pro 


Gly 


Gin 
20 


Pro 


Val 


He 


Phe 


Asp 
25 


Glu 


Val 


Leu 


Tyr 


Asn 
30 


Gin 


Gin 


Gly 


His 


Tyr 
35 


Asp 


Pro 


Ala 


Thr 


Gly 
40 


Lys 


Phe 


Thr 


Cys 


Lys 
45 


Val 


Pro 


Gly 


Leu 


Tyr 
50 


Tyr 


Phe 


Ser 


Phe 


His 
55 


Val 


Ser 


Ser 


Lys 


Gly 
60 


Thr 


Arg 


Gin 


Asn 


Val 


Cys 


Val 


Ser 


Leu 


Met 


Arg 


Ser 


Ser 


Arg 


Asn 


Gly Val 


Arg 


Gin 


Lys 


65 










70 










75 










80 


Val 


Met 


Glu 


Phe 


Cys 
85 


Asp 


Glu 


Tyr 


Ala 


Lys 
90 


Gly 


Thr 


Tyr 


Gin 


Val 
95 


Ala 


Ser 


Gly 


Gly 


Ala 

100 


Val 


Leu 


Gin 


Leu 


Arg 
105 


Gin 


Gly 


Asp 


Arg 


Val 
110 


Trp 


Leu 


Glu 


Leu 


Asp 
115 


Asp 


Lys 


Gin 


Thr 


Asn 
120 


Gly 


Leu 


Leu 


Gly 


Gly 
125 


Glu 


Gly 


Val 


His 


Ser 


Val 


Phe 


Ser 


Gly 


Phe 


Leu 


Leu 

















130 135 



<210> 73 

<211> 126 

<212> PRT 

<213> Catarrhini 



<400> 73 

Ala Phe Ser Ala Ala Arg Thr Thr 

1 5 
Ala Val Thr Phe Asp Lys Val Tyr 
20 

Ala Ala Thr Gly Gin Phe Arg Cys 
35 40 



Pro Leu Glu Gly Ala Ser Glu Met 

10 15 
Val Asn He Gly Gly Asp Phe Asp 

25 30 
Arg Val Pro Gly Ala Tyr Phe Phe 
45 
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Ser Phe Thr Val Gly Lys Ala Pro His 

50 55 
Val Arg Asn His Asp Glu Val Gin Ala 

65 70 
Arg Pro Ser Ala Arg Arg Ala Ala Ser 
85 

Asp Tyr Gly Asp Thr Val Trp Leu Arg 
100 105 
Ala Leu Gly Ala Pro Gly Ala Thr Phe 
115 120 




PCT/USOO/18184 



Lys Ser Leu Ser Val Met Leu 
60 

Leu Ala Phe Asp Glu Gin Arg 
75 80 
Gin Ser Ala Met Leu Gin Leu 

90 95 
Leu His Gly Ala Pro Gin Tyr 
110 

Ser Gly Tyr Leu Val 
125 



<210> 74 

<211> 134 

<212> PRT 

<213> Catarrhini 



<400> 74 

Ala Phe Ser Ala Ala Arg Thr Arg Ser Leu Val Gly Ser Asp Ala Gly 

15 10 15 

Ser Gly Pro Arg His Arg Pro Leu Ala Phe Asp Thr Glu Leu Val Asn 

20 25 30 

lie Gly Gly Asp Phe Asp Ala Ala Ala Gly Val Phe Arg Cys Arg Leu 

35 40 45 

Pro Gly Ala Tyr Phe Phe Ser Phe Thr Leu Gly Lys Leu Pro Arg Lys 

50 55 60 

Thr Leu Ser Val Lys Leu Met Lys Asn Arg Asp Glu Val Gin Ala Met 
65 70 75 80 

lie Tyr Asp Asp Gly Ala Ser Arg Arg Arg Glu Met Gin Ser Gin Ser 

85 90 95 

Val Met Leu Ala Leu Arg Arg Gly Asp Ala Val Trp Leu Leu Ser His 

100 105 110 

Asp His Asp Gly Tyr Gly Ala Tyr Ser Asn His Gly Lys Tyr lie Thr 

115 120 125 

Phe Ser Gly Phe Leu Val 
130 



<210> 75 
<211> 93 
<212> PRT 

<:213> Homo sapiens 



<400> 75 

Leu Cys Ser Gin Ser Gly Gin Thr 

1 5 
Arg Cys Ser Ser Ser Glu Gly Ala 
20 

Arg Leu Gly Thr Phe Pro Thr Pro 
35 40 
Glu Val Ser Gly Gin Leu lie Leu 

50 55 
Gly Thr Tyr Arg Cys Val Ala Thr 

65 70 
Glu Leu Thr Leu Ser Val Thr Glu 
85 



Ser Val Gly Gly Ser Thr Ala Leu 

10 15 
Pro Lys Pro Val Tyr Asn Trp Val 

25 30 
Ser Pro Gly Ser Met Val Gin Asp 
45 

Thr Asn Leu Ser Leu Thr Ser Ser 
60 

Asn Gin Met Gly Ser Ala Ser Cys 
75 80 
Pro Ser Gin Gly Arg 
90 



<210> 76 
<211> 23 
<212> PRT 
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<213> Homo sapiens 



<400> 76 

Val Ala Gly Ala Leu He Gly Val Leu Leu Gly Val Leu Leu Leu Ser 

15 10 15 

Val Ala Ala Phe Cys Leu Val 
20 



<210> 77 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 77 

Arg Phe Gin Lys Glu Arg Gly Lys Lys Pro Lys Glu Thr Tyr Gly Gly 

15 10 15 

Ser Asp Leu Arg 
20 



<210> 78 

<211> 125 

<212> PRT 

<213> Homo sapiens 

<400> 78 



Trp 


Ala 


Leu 


Gly 


Pro 


Thr 


Pro 


Gly 


Pro Gly 


Ser 


Ser Glu 


Leu 


Arg 


Ser 


1 








5 










10 








15 




Ala 


Phe 


Ser 


Ala 

20 


Ala 


Arg 


Thr 


Thr 


Pro 

25 


Leu 


Glu 


Gly Thr 


Ser 
30 


Glu 


Met 


Ala 


Val 


Thr 
35 


Phe 


Asp 


Lys 


Val 


Tyr 
40 


Val 


Asn 


He 


Gly Gly 
45 


Asp 


Phe 


Asp 


Val 


Ala 

50 


Thr 


Gly 


Gin 


Phe 


Arg 
55 


Cys 


Arg 


Val 


Pro 


Gly Ala 
60 


Tyr 


Phe 


Phe 


Ser 


Phe 


Thr 


Ala 


Gly 


Lys 


Ala 


Pro 


His 


Lys 


Ser 


Leu Ser 


Val 


Met 


Leu 


65 










70 










75 








80 


Val 


Arg 


Asn 


Arg 


Asp 
85 


Glu 


Val 


Gin 


Ala 


Leu 
90 


Ala 


Phe Asp 


Glu 


Gin 
95 


Arg 


Arg 


Pro 


Gly 


Ala 
100 


Arg 


Arg 


Ala 


Ala 


Ser 
105 


Gin 


Ser 


Ala Met 


Leu 
110 


Gin 


Leu 


Asp 


Tyr 


Gly 
115 


Asp 


Thr 


Val 


Trp 


Leu 
120 


Arg 


Leu 


His 


Gly Ala 
125 









<210> 79 
<211> 17 
<212> PRT 

<213> Homo sapiens 



<400> 79 

Pro Gin Tyr Ala Leu Gly Ala Pro Gly Ala Thr Phe Ser Gly Tyr Leu 

15 10 15 

Val 



<210> 80 

<211> 192 

<212> PRT 

<213> Homo sapiens 
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<400> 80 





Ala 


Asp 


Ala 


Asp 


Ala 


Asp 


Ala 


Pro 


Ala 


Arg 


Gl V 


Pro 


Pro 


Ala 


Pro 


1 








5 










1 0 










1 5 




Pro 


Glu 


Pro 


Aro 


Ser 


Ala 


Phe 


Ser 


Ala 


Ala 


Arg 


Thr 


Ara 


Ser 


Le u 


Val 








2 0 










2 5 










3 0 






Gly 


Se r 


Asp 


Ala 


Gl V 


Pro 


Gly 


Pro 


Arg 


Hi s 


Gl n 


Pro 


Leu 


Ala 


Phe 


Asp 






J D 










40 


















Thr 


Glu 


Phe 


Val 


As n 


lie 


Gly 


Gly 


Asp 


Phe 


Asp 


Ala 


Ala 


Ala 


Gly 


Val 




3 U 










55 










n 
D u 










Phe 


Arg 


Cys 


Arg 


Le u 


Pro 


Gly Ala 


Tvr 


Phe 


Phe 


Se r 


Phe 


Thr 


Leu 


Gly 


O o 










7 0 










7 5 










o u 




Leu 


Pro 


Arg 


Ly s 


Thr 


Leu 


Ser 


Val 


Lys 


Leu 


Me t 


jys 


Asn 


Arg 


Asp 










8 5 










9 0 










Q C 

y D 




Glu 


Va 1 


Gin 


Ala 


Me t 


lie 


Tyr 


Asp 


Asp 


Gly 


Ala 


Ser 


Arg 


Arg 


Arg 


Glu 








10 0 










X U D 










Tin 

X J. U 






Met 


Gin 


Ser 


Gin 


Ser 


Val 


Met 


Leu 


Ala 


Leu 


Arg 


Arg 


Gly 


Asp 


Ala 


Va 1 






115 










120 










125 








Trp 


Leu 


Leu 


Ser 


His 


Asp 


His 


Asp 


Gly 


Tyr 


Gly 


Ala 


Tyr 


Ser 


Asn 


His 




130 










135 










140 










Asp 


Leu 


Pro 


Thr 


Asp 


Leu 


Lys 


Thr 


Val 


Leu 


Pro 


Ser 


Trp 


Asp 


Val 


His 


145 










150 










155 










160 


Cys 


Cys 


Gin 


Val 


Asn 


Gin 


Arg 


Phe 


Glu 


Leu 


Cys 


He 


Gly 


Val 


He 


Pro 










165 










170 










175 




Glu 


Glu 


Ser 


Gin 


His 


Trp 


Asp 


Asp 


Ala 


He 


Arg 


Met 


Asp 


Thr 


Asp 


Leu 



180 185 190 



<210> 81 

<211> 325 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 

<222> 20 

<223> Xaa = Unknown amino acid 



<400> 81 

Met Ala Asp Leu Pro Gly Pro Phe Leu Cys Gly Ala Leu Leu Gly Phe 

15 10 15 

Leu Cys Leu Xaa Leu Ala Val Glu Val Lys Val Pro Thr Glu Pro Leu 

20 25 30 

Ser Thr Pro Leu Gly Lys Thr Ala Glu Leu Thr Cys Thr Tyr Ser Thr 

35 40 45 

Ser Val Gly Asp Thr Phe Ala Leu Glu Trp Ser Phe Val Gin Pro Gly 

50 55 60 

Lys Pro He Ser Glu Ser His Pro He Leu Tyr Phe Thr Asn Gly His 
65 70 75 80 

Leu Tyr Pro Thr Gly Ser Lys Ser Lys Arg Val Ser Leu Leu Gin Asn 

85 90 95 

Pro Pro Thr Val Gly Val Ala Thr Leu Lys Leu Thr Asp Val His Pro 

100 105 110 

Ser Asp Thr Gly Thr Tyr Leu Cys Gin Val Asn Asn Pro Pro Asp Phe 

115 120 125 

Tyr Thr Asn Gly Leu Gly Leu He Asn Leu Thr Val Leu Val Pro Pro 

130 135 140 

Ser Asn Pro Leu Cys Ser Gin Ser Gly Gin Thr Ser Val Gly Gly Ser 
145 150 155 160 
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Thr 

Asn 

Val 

Thr 

Ala 
225 
Val 

Val 

Pro 

Pro 

Phe 
305 
Leu 



Ala Leu 

Trp Val 

Gin Asp 
195 
Ser Ser 
210 

Ser Cys 

Thr Gly 

Ala Ala 

Lys Glu 
275 
Gly lie 
290 

Leu Glu 
Pro Met 



Arg Cys 
165 
Arg Leu 
180 

Glu Val 

Gly Thr 

Glu Leu 

Ala Leu 
245 
Phe Cys 
260 

Thr Tyr 

Ser Glu 

Arg Pro 

Val Val 
325 



'Ser Ser Ser Glu 
Gly Thr 
Ser Gly 



Tyr Arg 
215 
Thr Leu 
230 

He Gly 

Leu Val 

Gly Gly 

His Thr 
295 
Ser Ala 
310 



Phe Pro 
185 
Gin Leu 
200 

Cys Val 



Ser Val 

Val Leu 

Arg Phe 
265 
Ser Asp 
280 

Cys Met 
Ser Thr 



Gly Ala 
170 

Thr Pro 

lie Leu 

Ala Thr 

Thr Glu 
235 
Leu Gly 
250 

Gin Lys 

Leu Arg 

Arg Ala 

Val Thr 
315 



Pro 

Ser 

Thr 

Asn 
220 
Pro 

Val 

Glu 

Glu 

Asp 
300 
Thr 
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Lys Pro Val Tyr 
175 

Pro Gly Ser Met 
190 

Asn Leu Ser Leu 
205 

Gin Leu Gly Ser 

Ser Gin Gly Arg 
240 

Leu Leu Leu Ser 
255 

Arg Gly Lys Lys 
270 

Asp Ala He Ala 
285 

Ser Ser Lys Gly 

Thr Lys Ser Lys 
320 



<210> 82 

<211> 4317 

<212> DNA 

<213> Mus musculus 



<400> 82 

cctcctgctg 

actcgatccc 

cgggccccga 

ctgctggcac 

ggcagcgggg 

cctgtgatct 

tccaacatga 

ttccgcgtgg 

cagtgcctgt 

ggagctggca 

ctgccgcccc 

caggtgatcg 

ttggtgcccc 

aacgtgcgtg 

aacgctgaag 

aggccaccca 

tgtggcagca 

actgcctggg 

aagcctgtac 

aacagcaccc 

gactgcctca 

cccctcgcag 

gtgagcaccg 

tgctgtagtg 

gcagccttca 

gacgctcacc 

aaaccccagc 

gagagaggag 

cccactacca 



tcccctccct 
c tgctcgggc 
gcggtgccca 
tactcctgct 
cacaggcggg 
gcaagcggac 
cgctcatcgg 
tggtgtgccc 
gtcccccgga 
ccgggagttc 
ttgccccaga 
cagatcctcc 
tggggccagg 
tccatcaccc 
gcccagcctc 
ctcaaaagcc 
accctttgcc 
gacaaagcaa 
ctgtacgtgg 
actgccagga 
acaaccctgg 
cacagtgcat 
aacaccagtg 
tgggtaaagc 
39g^g3tctg 
atccaggggg 
agcttcctga 
tgaccatgga 
ccacctcacc 



acccttggct 
tcgacctcca 
cggcctggcc 
ggcgctgctg 
ggcggggcgc 
ctgtc tgaag 
agagaacggc 
tctaccctgc 
tttcacgggg 
aggccccggc 
aggagagtct 
cgggccgggg 
acaaatctcg 
tcctgaagct 
ttcccagcac 
actgggccgc 
tggccttacc 
gtgtcacaag 
ggaggtgggt 
tatcaacgaa 
ctcttatcgc 
tgccgacaaa 
ccagcaccct 
ctggggtgcc 
ccccggctgg 
aaagcgactt 
aagccccagc 
tccaccagtg 
cccccggcct 



tctcgccccg 
atctccgagg 
cctgcgatgc 
ggccccggcg 
tgggcccaac 
ggccagtgtc 
cacagcaccg 
atgaacggtg 
cgcttctgcc 
tggcccgacc 
gtggctagca 
gagggtcctc 
gcagaagtgc 
tccgttcagg 
ttgctgccgc 
tgcttccagg 
aagcaggaag 
tgcccacagc 
gctgactgcc 
tgtgcgatgc 
tgtgtctgcc 
ccagaggaga 
ctgaccacac 
cggtgccagc 
gaaagggtac 
ctccctcttc 
cgagcaccac 
agtgaggagc 
tacccagagc 



ctctgccctc 
gtcgtgcggc 
gccaggccgg 
gccgaggggt 
gcttcaaggt 
gggacagctg 
acacgctcac 
gccagtgctc 
aggtgcctgc 
gggccatgtc 
aacacgccat 
ctgcacaaca 
aggctccgcc 
tgcaccgcat 
atcccaagcc 
acacattgcc 
attgctgcgg 
ttcagtatac 
cccagggcta 
ccgggaatgt 
cgcccggtca 
agagcctgtg 
gcctaacccg 
gctgcccggc 
catatcctca 
ctgcacccga 
ccctcgagga 
gatcggtgca 
tcatctctcg 



tgctaccaac 
cccggatgcc 
cggattgggg 
gggccggccg 
ggtctttgcg 
tcagcagggc 
cggttctgcc 
ttcccgaaac 
tgcaggaacc 
cacaggcccg 
ttacgcggtg 
tgcagccttc 
ccccgtggtg 
cgaggggccg 
cccgcacccg 
caagcagcct 
tagcatcggt 
aggggtgcag 
caagaggctc 
gtgccatggt 
tagcttgggt 
tttccgcctt 
ccagctctgc 
agatggtaca 
cctcccacca 
cgggccaccc 
cacagaggaa 
gcagagccac 
cccctcccca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



62 



wo 01/00672 



cctaccttcc accggttc^t gccagacttg cccccatccc 
cccactcagg tcacagagac cgatgagtgc cgattgaacc 
cagtgtgtgc ctggcccctc ggattactcc tgccactgca 
ccgcagcacc gctactgtgt tgatgtgaac gagtgcgagg 
aaaggcatct gtatgaacac tggtggctcc tacaattgtc 
ctccacgtgg gtgcaggggg ccgctcgtgc gtggacctga 
ctgtgtgggg acggtggctt ctgcatcaac ttccctggtc 
cctggctacc ggctcaaggc ctcccgaccg cccatttgcg 
gaccctagca cctgccctga tggcaaatgt gaaaacaaac 
gcctgccagc ctggctaccg tagccagggg ggcggggcct 
tccgagggta ccccctgctc tcctggatgg tgtgagaacc 
acgtgtgccc aggggatacg aacccgcaca ggacgcctca 
tgtgaggctg ggaaagtgtg ccaagatggc atctgcacga 
tgtcagtgcc tctccggcta tcatctgtca agggatcgga 
gaatgtgact tccctgcggc ctgcatcggg ggtgactgca 
agatgtctct gtcccctggg tcatcggttg gtgggcggca 
gatgagtgca gccaggaccc aggcctgtgc ctgccccatg 
tcctatgtct gtgtctgtga tgagggtttc acactcaccc 
gaggtggagc agccccacca caagaaggag tgctacctta 
tgtgacagcg tattggctac caatgtcact cagcaggaat 
ggctggggag accactgcga aatctatccc tgtccagtct 
agcctggtgc ccgatgggaa aaggctacac tcaggacaac 
cctgcccacc gtgacatcga cgaatgcata ttgtttgggg 
aagtgtgtga acacgcagcc cggctacgag tgctactgca 
ggcaacctgc tggagtgcgt ggacgtggat gagtgcttgg 
ggagtgtgtg agaacacacg tggcggctac cgctgtgcct 
agtcccgcac aggcccagtg tctgatcccg gagagatgga 
aagtgtgctg gggccagcga ggagaggacg gcatgtgtat 
gccctcactt ttgatgactg ctgctgccgc cagccgcggc 
tgcccgccac gtggcaccgg gtcccagtgc ccgacttcac 
tgggacacaa gccccctgct actggggaag tctccgcgag 
gattcagatg agtgccgttg tgtgagcgga cgctgtgtgc 
tgcgagtgtc ctggaggctt tcagctggac gcctcccgtg 
gagtgccgag aactgaacca gcggggactg ctgtgtaaga 
agtggatcct tccgctgtgt ctgcaaagct ggcttcacgc 
gcgtgcctca gcgccgccgc tgatgatgca gccatagccc 
cgagggtatt ttcactgaaa gtggagacag acaagtacat 
agagcatgga cccaaggatc cttcagggcc cacaaatctc 
aggtgctcct gtctgcagag tgctgtctgc tttctcccaa 
gacatcagat ctgccccttt aatttactct tggctttcaa 
caaagcgggc agcatcaact gcttggcggg ttggactgag 
tagaatttat tgtggctctg attatgtaca ctagatgtgc 
acatggtttg tacaataaat acatccgccg ggaaaaaaaa 



• 
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gaagtgcagt ggagatcgcc 1800 

agaatatctg tggccatgga 1860 

acgctggcta ccggtcacac 1920 

cagagccctg cggccccggg 1980 

actgcaaccg aggctaccgc 2040 

acgagtgcgc caagcctcac 2100 

actacaaatg caactgctat 2160 

aagacatcga cgcgtgtcgc 2220 

ctggcagctt caagtgcatc 2280 

gtcgtgatgt caacgaatgc 2340 

ttccgggttc ttaccgttgc 2400 

gttgcataga cgtggatgag 2460 

acacaccagg ctctttccag 2520 

gccgctgtga ggacattgat 2^80 

tcaataccaa tggttcctac 2640 

ggaagtgcaa gaaagatata 2700 

cctgcgagaa cctccagggc 2760 

aggaccagca tgggtgtgag 2820 

acttcgatga cacagtgttc 2880 

gctgttgctc tctgggagct 2940 

acagctcagc cgaatttcac 3000 

aacattgtga actatgcatt 3060 

cagagatctg caaggagggc 3120 

agcagggctt ctactacgat 3180 

atgagtctaa ctgcaggaac 3240 

gcactccgcc ggcagagtac 3300 

gcacgcccca gagagacgtg 3360 

999gcc:cctg ggcgggacct 3420 

tgggtaccca gtgcagaccg 3480 

agagtgagag caattctttc 3540 

acgaagacag ctcagaggag 3600 

cacggccagg cggggcggta 3660 

cccgctgcgt ggacattgat 3720 

gcgagcggtg cgtgaacacc 3780 

gcagccgccc tcacgggcct 3840 

acacctcagt gatcgatcat 3900 

cctttgctcc tgaccaaacg 3960 

cttcccacac cccaaaccca 4020 

gggtgattcc tagaaacttc 4080 

ggcaaattga tattcacatc 4140 

ctgggaccca ggatgtgaaa 4200 

ctgacctgct gaccaggctc 4260 

aaaaaaaaaa aaaaaaa 4317 



<210> 83 
<211> 1251 
<212> PRT 

<213> Mus musculus 



<400> 83 

Met Arg Gin Ala Ala Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala Leu 

15 10 15 

Leu Gly Pro Gly Gly Arg Gly Val Gly Arg Pro Gly Ser Gly Ala Gin 

20 25 30 

Ala Gly Ala Gly Arg Trp Ala Gin Arg Phe Lys Val Val Phe Ala Pro 

35 40 45 

Val lie Cys Lys Arg Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser Cys 
50 55 60 
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Gin Gin Gly Ser AsfT^Met Thr Leu lie Gly Glu Asn Gly His Ser Thr 
65 70 75 80 

Asp Thr Leu Thr Gly Ser Ala Phe Arg Val Val Val Cys Pro Leu Pro 

85 90 95 

Cys Met Asn Gly Gly Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys Pro 

100 105 110 

Pro Asp Phe Thr Gly Arg Phe Cys Gin Val Pro Ala Ala Gly Thr Gly 

115 120 125 

Ala Gly Thr Gly Ser Ser Gly Pro Gly Trp Pro Asp Arg Ala Met Ser 

130 135 140 

Thr Gly Pro Leu Pro Pro Leu Ala Pro Glu Gly Glu Ser Val Ala Ser 
145 150 155 160 

Lys His Ala lie Tyr Ala Val Gin Val lie Ala Asp Pro Pro Gly Pro 

165 170 175 

Gly Glu Gly Pro Pro Ala Gin His Ala Ala Phe Leu Val Pro Leu Gly 

180 185 190 

Pro Gly Gin lie Ser Ala Glu Val Gin Ala Pro Pro Pro Val Val Asn 

195 200 205 

Val Arg Val His His Pro Pro Glu Ala Ser Val Gin Val His Arg He 

210 215 220 

Glu Gly Pro Asn Ala Glu Gly Pro Ala Ser Ser Gin His Leu Leu Pro 
225 230 235 240 

His Pro Lys Pro Gin His Pro Arg Pro Pro Thr Gin Lys Pro Leu Glu 

245 250 255 

Arg Cys Phe Gin Asp Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn Pro 

260 265 270 

Leu Pro Gly Leu Thr Lys Gin Glu Asp Cys Cys Gly Ser He Gly Thr 

275 280 285 

Ala Trp Gly Gin Ser Lys Cys His Lys Cys Pro Gin Leu Gin Tyr Thr 

290 295 300 

Gly Val Gin Lys Pro Val Pro Val Arg Gly Glu Val Gly Ala Asp Cys 
305 310 315 320 

Pro Gin Gly Tyr Lys Arg Leu Asn Ser Thr His Cys Gin Asp He Asn 

325 330 335 

Glu Cys Ala Met Pro Gly Asn Val Cys His Gly Asp Cys Leu Asn Asn 

340 345 350 

Pro Gly Ser Tyr Arg Cys Val Cys Pro Pro Gly His Ser Leu Gly Pro 

355 360 365 

Leu Ala Ala Gin Cys He Ala Asp Lys Pro Glu Glu Lys Ser Leu Cys 

370 375 380 

Phe Arg Leu Val Ser Thr Glu His Gin Cys Gin His Pro Leu Thr Thr 
385 390 395 400 

Arg Leu Thr Arg Gin Leu Cys Cys Cys Ser Val Gly Lys Ala Trp Gly 

405 410 415 

Ala Arg Cys Gin Arg Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys Glu 

420 425 430 

He Cys Pro Gly Trp Glu Arg Val Pro Tyr Pro His Leu Pro Pro Asp 

435 440 445 

Ala His His Pro Gly Gly Lys Arg Leu Leu Pro Leu Pro Ala Pro Asp 

450 455 460 

Gly Pro Pro Lys Pro Gin Gin Leu Pro Glu Ser Pro Ser Arg Ala Pro 
465 470 475 480 

Pro Leu Glu Asp Thr Glu Glu Glu Arg Gly Val Thr Met Asp Pro Pro 

485 490 495 

Val Ser Glu Glu Arg Ser Val Gin Gin Ser His Pro Thr Thr Thr Thr 

500 505 510 

Ser Pro Pro Arg Pro Tyr Pro Glu Leu He Ser Arg Pro Ser Pro Pro 



515 



520 



525 
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i 










Thr 


Phe 


His 


Arg 


Phi 




Pro 


Asp 


Leu 




530 










535 






Glu 


He 


Ala 


Pro 


Thr 


Gin 


Val 


Thr 


Glu 


545 










550 








Gin 


Asn 


He 


Cys 


Gly 


His 


Gly 


Gin 


Cys 










565 










Ser 


Cys 


His 


Cys 


Asn 


Ala 


Gly 


Tyr 


Arg 








580 










585 


Cys 


Val 


Asp 


Val 


Asn 


Glu 


Cys 


Glu 


Ala 






595 










600 




Gly 


He 


Cys 


Met 


Asn 


Thr 


Gly 


Gly 


Ser 




610 










615 






Gly 


Tyr 


Arg 


Leu 


His 


Val 


Gly 


Ala 


Gly 


625 










630 








Asn 


Glu 


Cys 


Ala 


Lys 


Pro 


His 


Leu 


Cys 










645 










Asn 


Phe 


Pro 


Gly 


His 


Tyr 


Lys 


Cys 


Asn 








660 










665 


Lys 


Ala 


Ser 


Arg 


Pro 


Pro 


He 


Cys 


Glu 






675 










680 




Pro 


Ser 


Thr 


Cys 


Pro 


Asp 


Gly 


Lys 


Cys 




690 










695 






Lys 


Cys 


He 


Ala 


Cys 


Gin 


Pro 


Gly 


Tyr 


705 










710 








Cys 


Arg 


Asp 


Val 


Asn 


Glu 


Cys 


Ser 


Glu 










725 










Trp 


Cys 


Glu 


Lys 


Leu 


Pro 


Gly 


Ser 


Tyr 








740 










745 


He 


Arg 


Thr 


Arg 


Thr 


Gly 


Arg 


Leu 


Ser 






755 










760 




Glu 


Ala 


Gly 


Lys 


Val 


Cys 


Gin 


Asp 


Gly 




770 










775 






Ser 


Phe 


Gin 


Cys 


Gin 


Cys 


Leu 


Ser 


Gly 


785 










790 








Ser 


Arg 


Cys 


Glu 


Asp 


He 


Asp 


Glu 


Cys 










805 










Gly Gly 


Asp 


Cys 


He 


Asn 


Thr 


Asn 


Gly 








820 










825 


Leu 


Gly 


His 


Arg 


Leu 


Val 


Gly 


Gly 


Arg 






835 










840 




Glu 


Cys 


Ser 


Gin 


Asp 


Pro 


Gly 


Leu 


Cys 




850 










855 






Leu 


Gin 


Gly 


Ser 


Tyr 


Val 


Cys 


Val 


Cys 


865 










870 








Gin 


Asp 


Gin 


His 


Gly 


Cys 


Glu 


Glu 


Val 










885 










Glu 


Cys 


Tyr 


Leu 


Asn 


Phe 


Asp 


Asp 


Thr 








900 










905 


Ala 


Thr 


Asn 


Val 


Thr 


Gin 


Gin 


Glu 


Cys 






915 










920 




Trp 


Gly 


Asp 


His 


Cys 


Glu 


He 


Tyr 


Pro 




930 










935 






Glu 


Phe 


His 


Ser 


Leu 


Val 


Pro 


Asp 


Gly 


945 










950 








Gin 


His 


Cys 


Glu 


Leu 


Cys 


He 


Pro 


Ala 










965 










He 


Leu 


Phe 


Gly 


Ala 


Glu 


He 


Cys 


Lys 



980 985 
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Pro Pro Ser Arg S^^^la Val 
540 



Thr 


Asp 


Glu 


Cys 


Arg 


Leu 


Asn 




555 










560 


Val 


Pro Gly 


Pro 


Ser 


Asp 


Tyr 


570 










575 




Ser 


His 


Pro 


Gin 


His 


Arg 


Tyr 










590 






Glu 


Pro 


Cys 


Gly 


Pro 


Gly 


Lys 








605 








Tyr 


Asn 


Cys 


His 


Cys 


Asn 


Arg 






620 










Gly 


Arg 


Ser 


Cys 


Val 


Asp 


Leu 




635 










640 


Gly 


Asp 


Gly 


Gly 


Phe 


Cys 


He 


650 










655 




Cys 


Tyr 


Pro 


Gly 


Tyr 


Arg 


Leu 










670 






Asp 


He 


Asp 


Glu 


Cys 


Arg 


Asp 








685 








Glu 


Asn 


Lys 


Pro 


Gly 


Ser 


Phe 






700 










Arg 


Ser 


Gin 


Gly 


Gly 


Gly 


Ala 




715 










720 


Gly 


Thr 


Pro 


Cys 


Ser 


Pro 


Gly 


730 










735 




Arg 


Cys 


Thr 


Cys 


Ala 


Gin 


Gly 










750 






Cys 


He 


Asp 


Val 


Asp 


Asp 


Cys 








765 








He 


Cys 


Thr 


Asn 


Thr 


Pro 


Gly 






780 










Tyr 


His 


Leu 


Ser 


Arg 


Asp 


Arg 




795 










800 


Asp 


Phe 


Pro 


Ala 


Ala 


Cys 


He 


810 










815 




Ser 


Tyr 


Arg 


Cys 


Leu 


Cys 


Pro 










830 






Lys 


Cys 


Lys 


Lys 


Asp 


He 


Asp 








845 








Leu 


Pro 


His 


Ala 


Cys 


Glu 


Asn 






860 










Asp 


Glu 


Gly 


Phe 


Thr 


Leu 


Thr 




875 










880 


Glu 


Gin 


Pro 


His 


His 


Lys 


Lys 


890 










895 




Val 


Phe 


Cys 


Asp 


Ser 


Val 


Leu 










910 






Cys 


Cys 


Ser 


Leu 


Gly 


Ala 


Gly 








925 








Cys 


Pro 


Val 


Tyr 


Ser 


Ser 


Ala 






940 










Lys 


Arg 


Leu 


His 


Ser 


Gly 


Gin 




955 










960 


His 


Arg 


Asp 


He 


Asp 


Glu 


Cys 


970 










975 




Glu 


Gly 


Lys 


Cys 


Val 


Asn 


Ser 



990 



65 



wo 01/00672 PCT/IIS00/181«4 

Gin Pro Gly Tyr GlwPTys Tyr Cys Lys Gin Gly Phe Tyr T^^^T^sp Gly 

995 1000 1005 

Asn Leu Leu Glu Cys Val Asp Val Asp Glu Cys Leu Asp Glu Ser Asn 

1010 1015 1020 

Cys Arg Asn Gly Val Cys Glu Asn Thr Trp Arg Leu Pro Cys Ala Cys 
1025 1030 1035 1040 

Thr Pro Pro Ala Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu Ser Pro 

1045 1050 1055 

Glu Glu Met Glu His Ala Pro Glu Arg Arg Glu Val Cys Trp Gly Gin 

1060 1065 1070 

Arg Gly Glu Asp Gly Met Cys Met Gly Pro Leu Ala Gly Pro Ala Leu 

1075 1080 1085 

Thr Phe Asp Asp Cys Cys Cys Arg Gin Pro Arg Leu Gly Tyr Gin Cys 

1090 1095 1100 

Arg Pro Cys Pro Pro Arg Gly Thr Gly Ser Gin Cys Pro Thr Ser Gin 
1105 1110 1115 1120 

Ser Glu Ser Asn Ser Phe Trp Asp Thr Ser Pro Leu Leu Leu Gly Lys 

1125 1130 1135 

Ser Pro Arg Asp Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu Cys Arg 

1140 1145 1150 

Cys Val Ser Gly Pro Cys Val Pro Arg Pro Gly Gly Ala Val Cys Glu 

1155 1160 1165 

Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys Val Asp 

1170 1175 1180 

lie Asp Glu Cys Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys Lys Ser 
1185 1190 1195 1200 

Glu Arg Cys Val Asn Thr Ser Gly Ser Phe Arg Cys Val Cys Lys Ala 

1205 1210 1215 

Gly Phe Thr Arg Ser Arg Pro His Gly Pro Ala Cys Leu Ser Ala Ala 

1220 1225 1230 

Ala Asp Asp Ala Ala lie Ala His Thr Ser Val He Asp His Arg Gly 

1235 1240 1245 

Tyr Phe His 
1250 

<210> 84 
<211> 353 
<212> PRT 

<213> Arabidopsis thaliama 
<400> 84 

Met Ala Asn Asp Gin Lys Asn Ser Glu Ser Phe Pro Ala Lys Glu Asp 

15 10 15 

His Lys Lys Asp Asp Ala Ala Ala Pro Ala Glu Val Asp His Lys Asp 

20 25 30 

Glu Phe Ser Ala Ser Gin Pro His Pro Val Glu Asn He Val Leu Val 

35 40 45 

Gly Arg Thr Gly Asn Gly Lys Ser Ala Thr Gly Asn Ser lie Val Arg 

50 55 60 

Ser Lys Val Phe Lys Ser Lys Thr Lys Ser Ser Gly Val Thr Met Glu 
65 70 75 80 

Cys His Ala Val Lys Ala Val Thr Pro Glu Gly Pro He Leu Asn Val 

85 90 95 

He Asp Thr Pro Gly Leu Phe Asp Leu Ser Val Ser Ala Glu Phe He 

100 105 110 

Gly Lys Glu He Val Lys Cys Leu Thr Leu Ala Asp Gly Gly Leu His 
115 120 125 
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# II 

Ala Val Leu Leu VawEeu Ser Val Arg Thr Arg He Ser GnT'Glu Glu 

130 135 140 

Glu Met Val Leu Ser Thr Leu Gin Val Leu Phe Gly Ser Lys Tie Val 
145 150 155 160 

Asp Tyr Leu He Val Val Phe Thr Gly Gly Asp Val Leu Glu Asp Asp 

165 170 175 

Gly Met Thr Leu Glu Asp Tyr Leu Gly Asp Asn Met Pro Asp Phe Leu 

180 185 190 

Lys Arg Val Leu He Leu Cys Gly Gin Arg Met He Leu Phe Asp Asn 

195 200 205 

Lys Thr Lys Asp Asp Glu Lys Lys Thr Lys Gin Val His Glu Leu Leu 

210 215 220 

Lys Leu He Asp Leu Val Arg Lys Gin Asn Asn Asn He Pro Tyr Thr 
225 230 235 240 

Asp Glu Met Tyr His Met He Lys Glu Glu Asn Glu Arg His Lys Lys 

245 250 255 

Glu Gin Glu Glu Leu Glu Ser Lys Gly His Ser Glu Glu Gin Leu Ala 

260 265 270 

Ala Leu Met Lys Glu Leu Gin He Met Asn Glu Arg Asn Leu Lys Ala 

275 280 285 

Met Ala Glu Met Met Glu Lys Asn Met Lys He Ala Met Glu Ala Gin 

290 295 300 

Glu Lys Leu Phe Glu Gin Arg Glu Lys Ala Gin Glu Met Ser Tyr Gin 
305 310 315 320 

Gin Lys Met Glu Met Gin Glu Lys Leu Lys Gin Met Glu Gly Arg Met 

325 330 335 

Arg Ala Glu Met Glu Ala Gin Met Leu Ser Arg Gin Gin Cys Ser He 
340 345 350 

Leu 



<210> 85 
<211> 1251 
<212> PRT 

<213> Homo sapiens 



<400> 85 



Met 


Arg 


Gin 


Ala 


Ala 


Leu 


Gly 


Leu 


1 








5 








Leu 


Gly 


Pro 


Gly 


Gly 


Arg Gly 


Val 








20 










Ala 


Gly 


Ala 

35 


Gly 


Arg 


Trp 


Ala 


Gin 
40 


Val 


He 

50 


Cys 


Lys 


Arg 


Thr 


Cys 
55 


Leu 


Gin 


Gin 


Gly 


Ser 


Asn 


Met 


Thr 


Leu 


65 










70 






Asp 


Thr 


Leu 


Thr 


Gly 
85 


Ser 


Ala 


Phe 


Cys 


Met 


Asn 


Gly 
100 


Gly 


Gin 


Cys 


Ser 


Pro 


Asp 


Phe 
115 


Thr 


Gly 


Arg 


Phe 


Cys 
120 


Ala 


Gly 
130 


Thr 


Gly 


Ser 


Ser 


Gly 
135 


Pro 


Thr 


Gly 


Pro 


Leu 


Pro 


Pro 


Leu 


Ala 


145 










150 







Leu 


Ala 
10 


Leu 


Leu 


Leu 


Leu 


Ala 
15 


Leu 


Gly 


Arg 


Pro Gly 


Ser 


Gly 


Ala 


Gin 


25 










30 






Arg 


Phe 


Lys 


Val 


Val 
45 


Phe 


Ala 


Pro 


Lys 


Gly 


Gin 


Cys 
60 


Arg 


Asp 


Ser 


Cys 


He 


Gly 


Glu 
75 


Asn 


Gly 


His 


Ser 


Thr 
80 


Arg 


Val 
90 


Val 


Val 


Cys 


Pro 


Leu 
95 


Pro 


Ser 


Arg 


Asn 


Gin 


Cys 


Leu 


Cys 


Pro 


105 










110 






Gin 


Val 


Pro 


Ala 


Ala 
125 


Gly 


Thr 


Gly 


Gly 


Trp 


Pro 


Asp 
140 


Arg 


Ala 


Met 


Ser 


Pro 


Glu 


Gly 


Glu 


Ser 


Val 


Ala 


Ser 



155 160 
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Lys 


His 


Ala 


He 


T^ 

165 


pia 


Val 


Gin 


Val 


He 
170 


Ala 


Asp 


Pro 


p" 


^ly 

175 


Pro 


Gly 


Glu 


Gly 


Pro 
180 


Pro 


Ala 


Gin 


His 


Ala 
185 


Ala 


Phe 


Leu 


Val 


Pro 
190 


Leu 


Gly 


Pro 


Gly 


Gin 
195 


He 


Ser 


Ala 


Glu 


Val 
200 


Gin 


Ala 


Pro 


Pro 


Pro 
205 


Val 


Val 


Asn 


val 


Arg 
2 10 


Val 


His 


His 


Pro 


Pro 
215 


Glu 


Ala 


Ser 


Val 


Gin 
220 


Val 


His 


Arg 


He 


Glu 


Gly 


Pro 


Asn 


Ala 


Glu 


Gly 


Pro 


Ala 


Ser 


Ser 


Gin 


His 


Leu 


Leu 


Pro 


225 










230 










235 










240 


His 


Pro 


Lys 


Pro 


Gin 
24 5 


His 


Pro 


Arg 


Pro 


Pro 
250 


Thr 


Gin 


Lys 


Pro 


Leu 
255 


Gly 


Arg 


Cys 


Phe 


Gin 
260 


Asp 


Thr 


Leu 


Pro 


Lys 
265 


Gin 


Pro 


Cys 


Gly 


Ser 
270 


Asn 


Pro 


Leu 


Pro 


Gly 
275 


Leu 


Thr 


Lys 


Gin 


Glu 
280 


Asp 


Cys 


Cys 


Gly 


Ser 
285 


He 


Gly 


Thr 


Ala 


Trp 
290 


Gly 


Gin 


Ser 


Lys 


Cys 
295 


His 


Lys 


Cys 


Pro 


Gin 
300 


Leu 


Gin 


Tyr 


Thr 


Gly Val 


Gin 


Lys 


Pro 


Val 


Pro 


Val 


Arg 


Gly 


Glu 


Val 


Gly 


Ala 


Asp 


Cys 


305 










310 










315 










320 


Pro 


Gin 


Gly 


Tyr 


Lys 
325 


Arg 


Leu 


Asn 


Ser 


Thr 
330 


His 


Cys 


Gin 


Asp 


He 
335 


Asn 


Glu 


Cys 


Ala 


Met 
34 0 


Pro 


Gly 


Asn 


Val 


Cys 
345 


His 


Gly 


Asp 


Cys 


Leu 
350 


Asn 


Asn 


Pro 


Gly 


Ser 
355 


Tyr 


Arg 


Cys 


Val 


Cys 
360 


Pro 


Pro 


Gly 


His 


Ser 
365 


Leu 


Gly 


Pro 


Leu 


Ala 

370 


Ala 


Gin 


Cys 


He 


Ala 

375 


Asp 


Lys 


Pro 


Glu 


Glu 
380 


Lys 


Ser 


Leu 


Cys 


Phe 


Arg 


Leu 


Val 


Ser 


Thr 


Glu 


His 


Gin 


Cys 


Gin 


His 


Pro 


Leu 


Thr 


Thr 


385 










390 










395 










400 


Arg 


Leu 


Thr 


Arg 


Gin 
405 


Leu 


Cys 


Cys 


Cys 


Ser 
410 


Val 


Gly 


Lys 


Ala 


Trp 
415 


Gly 


Ala 


Arg 


Cys 


Gin 
420 


Arg 


Cys 


Pro 


Ala 


Asp 
425 


Gly 


Thr 


Ala 


Ala 


Phe 
430 


Lys 


Glu 


lie 


Cys 


Pro 
435 


Gly 


Trp 


Glu 


Arg 


Val 
440 


Pro 


Tyr 


Pro 


His 


Leu 
445 


Pro 


Pro 


Asp 


Ala 


His 
450 


His 


Pro 


Gly 


Gly 


Lys 
455 


Arg 


Leu 


Leu 


Pro 


Leu 
460 


Pro 


Ala 


Pro 


Asp 


Gly 


Pro 


Pro 


Lys 


Pro 


Gin 


Gin 


Leu 


Pro 


Glu 


Ser 


Pro 


Ser 


Arg 


Ala 


Pro 


465 










470 










475 










480 


Pro 


Leu 


Glu 


Asp 


Thr 
485 


Glu 


Glu 


Glu 


Arg 


Gly 
490 


Val 


Thr 


Met 


Asp 


Pro 
495 


Pro 


Val 


Ser 


Glu 


Glu 
500 


Arg 


Ser 


Val 


Gin 


Gin 
505 


Ser 


His 


Pro 


Thr 


Thr 
510 


Thr 


Thr 


Ser 


Pro 


Pro 
515 


Arg 


Pro 


Tyr 


Pro 


Glu 
520 


Leu 


He 


Ser 


Arg 


Pro 
525 


Ser 


Pro 


Pro 


Thr 


Phe 
530 


His 


Arg 


Phe 


Leu 


Pro 
535 


Asp 


Leu 


Pro 


Pro 


Ser 
540 


Arg 


Ser 


Ala 


Val 


Glu 


He 


Ala 


Pro 


Thr 


Gin 


Val 


Thr 


Glu 


Thr 


Asp 


Glu 


Cys 


Arg 


Leu 


Asn 


545 










550 










555 










560 


Gin 


Asn 


He 


Cys 


Gly 
565 


His 


Gly 


Gin 


Cys 


Val 
570 


Pro 


Gly 


Pro 


Ser 


Asp 
575 


Tyr 


Ser 


Cys 


His 


Cys 
580 


Asn 


Ala 


Gly 


Tyr 


Arg 
585 


Ser 


His 


Pro 


Gin 


His 
590 


Arg 


Tyr 


Gys 


Val 


Asp 
595 


Val 


Asn 


Glu 


Cys 


Glu 
600 


Ala 


Glu 


Pro 


Cys 


Gly 
605 


Pro 


Gly 


Lys 


Gly 


He 
610 


Cys 


Met 


Asn 


Thr 


Gly 
615 


Gly 


Ser 


Tyr 


Asn 


Cys 
620 


His 


Cys 


Asn 


Arg 
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1 Gly Ala Gly Gly Arg Ser Cys V^^sp Leu 
625 630 635 640 

Asn Glu Cys Ala Lys Pro His Leu Cys Gly Asp Gly Gly Phe Cys lie 

645 650 655 

Asn Phe Pro Gly His Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg Leu 

660 665 670 

Lys Ala Ser Arg Pro Pro lie Cys Glu Asp lie Asp Glu Cys Arg Asp 

675 680 685 

Pro Ser Thr Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser Phe 

690 695 700 

Lys Cys He Ala Cys Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly Ala 
705 710 715 720 

Cys Arg Asp Val Asn Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro Gly 

725 730 735 

Trp Cys Glu Lys Leu Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin Gly 

740 745 750 

He Arg Thr Arg Thr Gly Arg Leu Ser Cys He Asp Val Asp Asp Cys 

755 760 765 

Glu Ala Gly Lys Val Cys Gin Asp Gly He Cys Thr Asn Thr Pro Gly 

770 775 780 

Ser Phe Gin Cys Gin Cys Leu Ser Gly Tyr His Leu Ser Arg Asp Arg 
785 790 795 800 

Ser Arg Cys Glu Asp He Asp Glu Cys Asp Phe Pro Ala Ala Cys He 

805 810 815 

Gly Gly Asp Cys He Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys Pro 

820 825 830 

Leu Gly His Arg Leu Val Gly Gly Arg Lys Cys Lys Lys Asp He Asp 

835 840 845 

Glu Cys Ser Gin Asp Pro Gly Leu Cys Leu Pro His Ala Cys Glu Asn 

850 855 860 

Leu Gin Gly Ser Tyr Val Cys Val Cys Asp Glu Gly Phe Thr Leu Thr 
865 870 875 880 

Gin Asp Gin His Gly Cys Glu Glu Val Glu Gin Pro His His Lys Lys 

885 890 895 

Glu Cys Tyr Leu Asn Phe Asp Asp Thr Val Phe Cys Asp Ser Val Leu 

900 905 910 

Ala Thr Asn Val Thr Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala Gly 

915 920 925 

Trp Gly Asp His Cys Glu He Tyr Pro Cys Pro Val Tyr Ser Ser Ala 

930 935 940 

Glu Phe His Ser Leu Val Pro Asp Gly Lys Arg Leu His Ser Gly Gin 
945 950 955 960 

Gin His Cys Glu Leu Cys He Pro Ala His Arg Asp He Asp Glu Cys 

965 970 975 

He Leu Phe Gly Ala Glu He Cys Lys Glu Gly Lys Cys Val Asn Ser 

980 985 990 

Gin Pro Gly Tyr Glu Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp Gly 

995 1000 1005 

Asn Leu Leu Glu Cys Val Asp Val Asp Glu Cys Leu Asp Glu Ser Asn 

1010 1015 1020 

Cys Arg Asn Gly Val Cys Glu Asn Thr Trp Arg Leu Pro Cys Ala Cys 
1025 1030 1035 1040 

Thr Pro Pro Ala Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu Ser Pro 

1045 1050 1055 

Glu Glu Met Glu His Ala Pro Glu Arg Arg Glu Val Cys Trp Gly Gin 

1060 1065 1070 

Arg Gly Glu Asp Gly Met Cys Met Gly Pro Leu Ala Gly Pro Ala Leu 
1075 1080 1085 
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w • 

Thr Phe Asp Asp Cys Cys Cys Arg Gin Pro Arg Leu Gly Tyr Gin Cys 

1090 1095 1100 

Arg Pro Cys Pro Pro Arg Gly Thr Gly Ser Gin Cys Pro Thr Ser Gin 
1105 1110 1115 1120 

Ser Glu Ser Asn Ser Phe Trp Asp Thr Ser Pro Leu Leu Leu Gly Lys 

1125 1130 1135 

Ser Pro Arg Asp Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu Cys Arg 

1140 1145 1150 

Cys Val Ser Gly Pro Cys Val Pro Arg Pro Gly Gly Ala Val Cys Glu 

1155 1160 1165 

Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys Val Asp 

1170 1175 1180 

lie Asp Glu Cys Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys Lys Ser 
1185 1190 1195 1200 

Glu Arg Cys Val Asn Thr Ser Gly Ser Phe Arg Cys Val Cys Lys Ala 

1205 1210 1215 

Gly Phe Thr Arg Ser Arg Pro His Gly Pro Ala Cys Leu Ser Ala Ala 

1220 1225 1230 

Ala Asp Asp Ala Ala lie Ala His Thr Ser Val lie Asp His Arg Gly 

1235 1240 1245 

Tyr Phe His 
1250 



<210> 86 
<211> 353 
<212> PRT 

<213> Homo sapiens 



<400> 86 



Met 


Ala 


Asn 


Asp 


Gin 


Lys 


Asn 


Ser 


Glu 


Ser 


Phe 


Pro 


Ala 


Lys 


Glu 


Asp 


1 








5 










10 










15 




His 


Lys 


Lys 


Asp 
20 


Asp 


Ala 


Ala 


Ala 


Pro 
25 


Ala 


Glu 


Val 


Asp 


His 
30 


Lys 


Asp 


Glu 


Phe 


Ser 
35 


Ala 


Ser 


Gin 


Pro 


His 
40 


Pro 


Val 


Glu 


Asn 


He 
45 


Val 


Leu 


Val 


Gly 


Arg 
50 


Thr 


Gly 


Asn 


Gly 


Lys 
55 


Ser 


Ala 


Thr 


Gly 


Asn 
60 


Ser 


He 


Val 


Arg 


Ser 


Lys 


Val 


Phe 


Lys 


Ser 


Lys 


Thr 


Lys 


Ser 


Ser 


Gly 


Val 


Thr 


Met 


Glu 


65 










70 










75 










80 


Cys 


His 


Ala 


Val 


Lys 
85 


Ala 


Val 


Thr 


Pro 


Glu 
90 


Gly 


Pro 


He 


Leu 


Asn 
95 


Val 


He 


Asp 


Thr 


Pro 
100 


Gly 


Leu 


Phe 


Asp 


Leu 
105 


Ser 


Val 


Ser 


Ala 


Glu 
110 


Phe 


He 


Gly 


Lys 


Glu 
115 


He 


Val 


Lys 


Cys 


Leu 
120 


Thr 


Leu 


Ala 


Asp 


Gly 
125 


Gly 


Leu 


His 


Ala 


Val 
130 


Leu 


Leu 


Val 


Leu 


Ser 
135 


Val 


Arg 


Thr 


Arg 


He 
140 


Ser 


Gin 


Glu 


Glu 


Glu 


Met 


Val 


Leu 


Ser 


Thr 


Leu 


Gin 


Val 


Leu 


Phe 


Gly 


Ser 


Lys 


He 


Val 


145 










150 










155 










160 


Asp 


Tyr 


Leu 


He 


Val 


Val 


Phe 


Thr 


Gly Gly 


Asp 


Val 


Leu 


Glu 


Asp 


Asp 










165 










170 










175 




Gly 


Met 


Thr 


Leu 
180 


Glu 


Asp 


Tyr 


Leu 


Gly 
185 


Asp 


Asn 


Met 


Pro 


Asp 
190 


Phe 


Leu 


Lys 


Arg 


Val 
195 


Leu 


He 


Leu 


Cys 


Gly 
200 


Gin 


Arg 


Met 


He 


Leu 
205 


Phe 


Asp 


Asn 


Lys 


Thr 
210 


Lys 


Asp 


Asp 


Glu 


Lys 
215 


Lys 


Thr 


Lys 


Gin 


Val 
220 


His 


Glu 


Leu 


Leu 
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Lys Leu lie Asp Leu Val Arg Lys Gin Asn Asn Asn lie Pro Tyr Thr 
225 230 235 240 

Asp Glu Met Tyr His Met He Lys Glu Glu Asn Glu Arg His Lys Lys 

245 250 255 

Glu Gin Glu Glu Leu Glu Ser Lys Gly His Ser Glu Glu Gin Leu Ala 

260 265 270 

Ala Leu Met Lys Glu Leu Gin He Met Asn Glu Arg Asn Leu Lys Ala 

275 280 285 

Met Ala Glu Met Met Glu Lys Asn Met Lys He Ala Met Glu Ala Gin 

290 295 300 

Glu Lys Leu Phe Glu Gin Arg Glu Lys Ala Gin Glu Met Ser Tyr Gin 
305 310 315 320 

Gin Lys Met Glu Met Gin Glu Lys Leu Lys Gin Met Glu Gly Arg Met 

325 330 335 

Arg Ala Glu Met Glu Ala Gin Met Leu Ser Arg Gin Gin Cys Ser He 
340 345 350 

Leu 



<210> 87 

<211> 4317 

<212> DNA 

<213> Homo sapiens 



<400> 87 

cctcctgctg tcccctccct acccttggct tctcgccccg ctctgccctc tgctaccaac 60 

actcgatccc ctgctcgggc tcgacctcca atctccgagg gtcgtgcggc cccggatgcc 120 

cgggccccga gcggtgccca cggcctggcc cctgcgatgc gccaggccgg cggattgggg 180 

ctgctggcac tactcctgct ggcgctgctg ggccccggcg gccgaggggt gggccggccg 240 

ggcagcgggg cacaggcggg ggcggggcgc tgggcccaac gcttcaaggt ggtctttgcg 300 

cctgtgatct gcaagcggac ctgtctgaag ggccagtgtc gggacagctg tcagcagggc 360 

tccaacatga cgctcatcgg agagaacggc cacagcaccg acacgctcac cggttctgcc 420 

ttccgcgtgg tggtgtgccc tctaccctgc atgaacggtg gccagtgctc ttcccgaaac 480 

cagtgcctgt gtcccccgga tttcacgggg cgcttctgcc aggtgcctgc tgcaggaacc 540 

ggagctggca ccgggagttc aggccccggc tggcccgacc gggccatgtc cacaggcccg 600 

ctgccgcccc ttgccccaga aggagagtct gtggctagca aacacgccat ttacgcggtg 660 

caggtgatcg cagatcctcc cgggccgggg gagggtcctc ctgcacaaca tgcagccttc 120 

ttggtgcccc tggggccagg acaaatctcg gcagaagtgc aggctccgcc ccccgtggtg 780 

aacgtgcgtg tccatcaccc tcctgaagct tccgttcagg tgcaccgcat cgaggggccg 840 

aacgctgaag gcccagcctc ttcccagcac ttgctgccgc atcccaagcc cccgcacccg 900 

aggccaccca ctcaaaagcc actgggccgc tgcttccagg acacattgcc caagcagcct 960 

tgtggcagca accctttgcc tggccttacc aagcaggaag attgctgcgg tagcatcggt 1020 

actgcctggg gacaaagcaa gtgtcacaag tgcccacagc ttcagtatac aggggtgcag 1080 

aagcctgtac ctgtacgtgg ggaggtgggt gctgactgcc cccagggcta caagaggctc 1140 

aacagcaccc actgccagga tatcaacgaa tgtgcgatgc ccgggaatgt gtgccatggt 1200 

gactgcctca acaaccctgg ctcttatcgc tgtgtctgcc cgcccggtca tagcttgggt 1260 

cccctcgcag cacagtgcat tgccgacaaa ccagaggaga agagcctgtg tttccgcctt 1320 

gtgagcaccg aacaccagtg ccagcaccct ctgaccacac gcctaacccg ccagctctgc 1380 

tgctgtagtg tgggtaaagc ctggggtgcc cggtgccagc gctgcccggc agatggtaca 1440 

gcagccttca aggagatctg ccccggctgg gaaagggtac catatcctca cctcccacca 1500 

gacgctcacc atccaggggg aaagcgactt ctccctcttc ctgcacccga cgggccaccc 1560 

aaaccccagc agcttcctga aagccccagc cgagcaccac ccctcgagga cacagaggaa 1620 

gagagaggag tgaccatgga tccaccagtg agtgaggagc gatcggtgca gcagagccac 1680 

cccactacca ccacctcacc cccccggcct tacccagagc tcatctctcg cccctcccca 1740 

cctaccttcc accggttcct gccagacttg cccccatccc gaagtgcagt ggagatcgcc 1800 

cccactcagg tcacagagac cgatgagtgc cgattgaacc agaatatctg tggccatgga 1860 

cagtgtgtgc ctggcccctc ggattactcc tgccactgca acgctggcta ccggtcacac 1920 

ccgcagcacc gctactgtgt tgatgtgaac gagtgcgagg cagagccctg cggccccggg 1980 
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aaaggcatct gtatgaacac tggtggctcc tacaattgtc actgcaaccg 
ctccacgtgg gtgcaggggg ccgctcgtgc gtggacctga acgagtgcgc 
ctgtgtgggg acggtggctt ctgcatcaac ttccctggtc actacaaatg 
cctggctacc ggctcaaggc ctcccgaccg cccatttgcg aagacatcga 
gaccctagca cctgccctga tggcaaatgt gaaaacaaac ctggcagctt 
gcctgccagc ctggctaccg tagccagggg ggcggggcct gtcgtgatgt 
tccgagggta ccccctgctc tcctggatgg tgtgagaacc ttccgggttc 
acgtgtgccc aggggatacg aacccgcaca ggacgcctca gttgcataga 
tgtgaggctg ggaaagtgtg ccaagatggc atctgcacga acacaccagg 
tgtcagtgcc tctccggcta tcatctgtca agggatcgga gccgctgtga 
gaatgtgact tccctgcggc ctgcatcggg ggtgactgca tcaataccaa 
agatgtctct gtcccctggg tcatcggttg gtgggcggca ggaagtgcaa 
gatgagtgca gccaggaccc aggcctgtgc ctgccccatg cctgcgagaa 
tcctatgtct gtgtctgtga tgagggtttc acactcaccc aggaccagca 
gaggtggagc agccccacca caagaaggag tgctacctta acttcgatga 
tgtgacagcg tattggctac caatgtcact cagcaggaat gctgttgctc 
ggctggggag accactgcga aatctatccc tgtccagtct acagctcagc 
agcctggtgc ctgatgggaa aaggctacac tcaggacaac aacattgtga 
cctgcccacc gtgacatcga cgaatgcata ttgtttgggg cagagatctg 
aagtgtgtga acacgcagcc cggctacgag tgctactgca agcagggctt 
ggcaacctgc tggagtgcgt ggacgtggat gagtgcttgg atgagtctaa 
ggagtgtgtg agaacacacg tggcggctac cgctgtgcct gcactccgcc 
agtcccgcac aggcccagtg tctgatcccg gagagatgga gcacgcccca 
aagtgtgctg gggccagcga ggagaggacg gcatgtgtat ggggcccctg 
gccctcactt ttgatgactg ctgctgccgc cagccgcggc tgggtaccca 
tgcccgccac gtggcaccgg gtcccagtgc ccgacttcac agagtgagag 
tgggacacaa gccccctgct actggggaag tctccgcgag acgaagacag 
gattcagatg agtgccgttg tgtgagcgga cgctgtgtgc cacggccagg 
tgcgagtgtc ctggaggctt tcagctggac gcctcccgtg cccgctgcgt 
gagtgccgag aactgaacca gcggggactg ctgtgtaaga gcgagcggtg 
agtggatcct tccgctgtgt ctgcaaagct ggcttcacgc gcagccgccc 
gcgtgcctca gcgccgccgc tgatgatgca gccatagccc acacctcagt 
cgagggtatt ttcactgaaa gtggagacag acaagtacat cctttgctcc 
agagcatgga cccaaggatc cttcagggcc cacaaatctc cttcccacac 
aggtgctcct gtctgcagag tgctgtctgc tttctcccaa gggtgattcc 
gacatcagat ctgccccttt aatttactct tggctttcaa ggcaaattga 
caaagcgggc agcatcaact gcttggcggg ttggactgag ctgggaccca 
tagaatttat tgtggctctg attatgtaca ctagatgtgc ctgacctgct 
acatggtttg tacaataaat acatccgccg ggaaaaaaaa aaaaaaaaaa 



aggctaccgc 
caagcctcac 
caactgctat 
cgagtgtcgc 
caagtgcatc 
caacgaatgc 
ttaccgttgc 
cgtggatgag 
ctctttccag 
ggacat tgat 
tggttcctac 
gaaagatata 
cctccagggc 
tgggtgtgag 
cacagtgttc 
tctgggagct 
cgaatt tcac 
actatgcatt 
caaggagggc 
ctactacgat 
ctgcaggaac 
ggcagagtac 
gagagacgtg 
ggcgggacct 
gtgcagaccg 
caattctttc 
ctcagaggag 

cggggcggta 

ggacattgat 
cgtgaacacc 
tcacgggcct 
gatcgatcat 
tgaccaaacg 
cccaaaccca 
tagaaacttc 
tattcacatc 
ggatgtgaaa 
gaccaggctc 
aaaaaaa 



2040 
2100 
2160 
2 2 2 0 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2 700 
2760 
2 820 
2 880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4317 



<210> 88 

<211> 299 

<212> PRT 

<213> Mus musculus 



<400> 88 

Ser Val Ser Trp Pro Asp Thr Ala Gin Gly Thr Met Ala Asn Leu He 

15 10 15 

Leu Thr Ala Leu Glu Lys Ala Thr Leu Phe Leu Glu Asp Arg Leu Pro 

20 25 30 

Thr He Asn Leu Asp Gly Val Val Gly Phe Gin Val Leu Glu Val Gin 

35 40 45 

Leu Arg Gly Val Gin Glu Lys Trp Ala His Lys Pro Leu Leu Gin Pro 

50 55 60 

Leu Ser Met Arg Ala Gly Gin Met Ala Asn Thr Leu Ser Ala Leu Leu 
65 70 75 80 
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f 

Phi T 



Gin Lys Ser lie Phe Tyr Leu Lys Gin Ser Asp Pro Thr Tyr Leu Arg 

85 90 95 

Glu Phe Gin Pro Ser lie Gin Pro Gly Phe Trp Lys Leu Pro Asn Asp 

100 105 • 110 

Trp Thr Arg Thr Asn Ala Ser Leu Val Tyr Pro Trp Leu Glu Pro Leu 

115 120 125 

Asp Ser Phe Ser Glu Glu Ser Ser Asp Val Cys Leu Val Gin Leu Leu 

130 135 140 

Gly Thr Gly Thr Asp Ser Ser Gin Pro Cys Arg Leu Ser Asn Phe Cys 
145 150 155 160 

Arg Thr Leu Met Thr Lys Ala Gly Cys Ser Gly Tyr Ser Leu Ser His 

165 170 175 

Gin Leu Leu Phe Phe Leu Trp Ala Arg Met Gin Gly Cys Thr Glu Gly 

180 185 190 

Leu Phe Leu Gin Ser Gin His Tyr Met Asp lie Phe Cys Ala Asn Met 

195 200 205 

Met Glu Leu Asn His Arg Ala Glu Ala Val Gly Tyr Ala Tyr Pro Thr 

210 215 220 

Gin Asp Leu Phe Met Glu Asn lie Met Phe Cys Gly Met Ala Gly Phe 
225 230 235 240 

Ser Asp Phe Tyr Lys Leu Arg Trp Leu Glu Ala lie Leu Ser Trp Gin 

245 250 255 

Asn Pro Gin Val Gly Cys Phe Gly Arg Pro Asp Thr Lys Gly Glu Pro 

260 265 270 

Ser Glu Val Pro His Gin Gin Gly He Leu Arg Arg Val Arg Arg Arg 

275 280 285 

Glu Lys Leu Phe Ala Asp Gly Cys Ser Cys His 
290 295 

;210> 89 
:211> 17 
:212> PRT 

:213> Mus musculus 
:400> 89 

Asn Thr Ala Thr Ala Val Ala Ala Leu Gly Gly Phe Leu Tyr He Leu 

15 10 15 

Ala 



:210> 90 
;211> 50 
:212> PRT 

;213> Mus musculus 
:400> 90 

Glu Tyr His Pro Asp Asn Gly Asp Ala His Pro Glu Tyr Tyr Pro Asn 

15 10 15 

His Gly Asp Pro Tyr Ser Ser Ser Gin Ser Pro Ala Ser Asn Tyr Gin 

20 25 30 

Asp Gly Ala Ala Gly Pro Asp Val Gin Arg Thr Gly Arg Pro Leu Ser 
35 40 45 

Val Ser 
50 

:210> 91 
c211> 199 
:212> PRT 
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400> 91 



Me t 






Lsu 




Leu 


ni \/ 
o X y 


f iltr 


Leu 


o X y 


Pro 


A 1 a 
MX d 


Ala 
MX d 


Cys 


Trp 


Ala 
M X d 










c 










X u 










i b 




Lieu. 


Gly 


Pro 


Ala 


Gly 


Pro 


Gly 


Ser 


Ser 


Glu 


Leu 


A T*0 

MX y 


Ser 


Ala 


Phe 


Ser 








2 U 










2 b 










J U 








a 1 3 


Arg 


i nr 


i ni 


Pro 


Leu 


LjX U 


p 1 \/ 
vj X y 


i nx 


oe r 


r* 1 n 
oXU 


ne c 


Ala 

MX a 


V a X 


i nr 
















^ U 










4 b 










Asp 


Ly s 


V d X 


lyr 


V d X 


As n 


T 1 P> 

X X C 


o X y 


^-j X y 


Asp 


rllfcr 


Asp 


Ala 
MX d 


Ala 
M X d 


1 nr 




5 0 










c c 
o b 










60 










Gly 


Aif^ 


flit; 


Arg 


y D 


Arg 


V d X 


r X U 


vj X y 


Ala 
MX d 


Tyr 


D Vi o 
flic 


file 


C ca 

o e r 


D Vi £1 

File 


1 n r 


O D 




















7 5 










8 0 


Ala 


vj X y 


Ly s 




Pro 


n X o 


Ly s 


As n 


Leu 


Q <a >- 
OCX 


V d X 




Leu 


V a X 


Arg 


As n 










o b 










Q n 










95 






Asp 


ni 11 


V d X 


m n 


A 1 ;a 
rtX d 


Leu 


Ala 
MX d 


D"h c» 
Ir lie 


Asp 


Ly s 


Pi n 

X n 


Arg 


Arg 


Pro 


P 1 T 

oxy 








inn 
X U U 










X U b 










110 






Ala 




Ar ^ 


A 1 
riX d 


A 1 
MXd 


o C X 


OX 11 


Ser 


Ala 
MX d 


M<=> t- 


Leu 


Pin 
Li X 11 


Leu 


Asp 


Tyr 


p 1 17 
lj ly 






115 










120 










125 








Asp 


Thr 


Val 


Trp 


Leu 


Arg 


Leu 


His 


Gly 


Ala 


Pro 


His 


Tyr 


Ala 


Leu 


Gly 




130 










135 










140 










Ala 


Pro 


Gly 


Ala 


Thr 


Phe 


Ser 


Gly 


Tyr 


Leu 


Val 


Tyr 


Ala 


Asp 


Ala 


Asp 


145 










150 










155 










160 


Ala 


Asp 


Ala 


Pro 


Ala 


Arg 


Gly 


Pro 


Ala 


Ala 


Pro 


Glu 


Pro 


Arg 


Ser 


Ala 










165 










170 










175 




Phe 


Ser 


Ala 


Arg 


His 


Ala 


Thr 


Trp 


Trp 


Ala 


Pro 


Asn 


Pro 


Pro 


Arg 


Pro 








180 










185 










190 






Ala 


Pro 


Arg 


Arg 


Leu 


Ala 


Phe 





















195 

210> 92 
211> 597 
212> DMA 

213> Mus musculus 



400> 92 

atgctgctgc tcttgctggg cttcctaggc ccggcggcct gctgggcact gggcccggct 60 

ggccctggct cctcggagct gcggtcagcc ttctcggcgg ctcgcaccac cccgctggag 120 

ggcacgtcgg agatggcggt gaccttcgac aaggtgtacg tgaacatcgg gggtgacttc 180 

gacgcagcca ccgggcggtt ccgctgtcgc gtgccgggcg cctacttctt ctccttcacg 240 

gccggcaagg ccccgcacaa aaacctgtcg gtgatgctgg tgcgcaaccg cgacgaagtg 300 

caggcgctgg ctttcgacaa gcagcgacgg ccaggcgcgc ggcgcgcggc cagccaaagc 360 

gccatgctgc agctcgacta cggcgacacg gtgtggctgc ggctgcacgg cgctccgcat 420 

tacgcgctcg gcgcgccggg cgccaccttc agcggctacc tggtgtacgc ggacgccgac 480 

gccgacgcgc ctgcgcgcgg gcccgcggcc ccggagccgc gctcggcctt ctccgcgcgc 540 

cacgccacct ggtgggctcc gaacccgccc cggcccgcgc cacggcgttt ggccttc 597 



210> 93 
211> 126 
212 > PRT 

213> Mus musculus 
400> 93 

Ala Phe Ser Ala Ala Arg Thr Thr Pro Leu Glu Gly Thr Ser Glu Met 

15 10 15 

Ala Val Thr Phe Asp Lys Val Tyr Val Asn lie Gly Gly Asp Phe Asp 
20 25 30 
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r\ J. d 




TVi r- 
i 11 X 


o 1 y 


Arg 


D V-i 
lie 


Arg 


Cys 

A Pi 


Arg 


V a 1 


Pro 


Lii y 


Ala 

Ai a 
45 


Tyr 


Phe 


Phe 




Pne 

D u 


Thr 


Ala 


Gly 


Lys 


Ala 

q c 
D O 


Pro 


His 


Lys 


Asn 


Leu 
60 


Ser 


Val 


Met 


Leu 


Val 


Arg 


Asn 


Arg 


Asp 


Glu 


Val 


Gin 


Ala 


Leu 


Ala 


Phe 


Asp 


Lys 


Gin 


Arg 


65 










70 










75 










80 


Arg 


Pro 


Gly 


Ala 


Arg 
85 


Arg 


Ala 


Ala 


Ser 


Gin 
90 


Ser 


Ala 


Met 


Leu 


Gin 
95 


Leu 


Asp 


Tyr 


Gly 


Asp 


Thr 


Val 


Trp 


Leu 


Arg 


Leu 


His 


Gly Ala 


Pro 


His 


Tyr 








100 










105 










110 






Ala 


Leu 


Gly 


Ala 


Pro 


Gly 


Ala 


Thr 


Phe 


Ser 


Gly 


Tyr 


Leu 


Val 







115 



120 



125 



75 



